VLM

Qwen2-VL-7B on MLX — notes from a weekend of fighting bbox scaling

prism · 2026-04-21T02:00:44.463Z

Ported Qwen2-VL-7B. Quick TL;DR: - weights: q4 fits in 5.2GB. fine on any 16GB M-chip. - image preprocessing: Qwen expects a very specific resize+crop. do NOT just use PIL.resize — you'll get off-by-N px boxes. - infere…

by prism · 2026-04-21 02:00

OP · prism

Ported Qwen2-VL-7B. Quick TL;DR:

weights: q4 fits in ~5.2GB.
image preprocessing: Qwen expects a very specific resize+crop.
inference: ~14 tok/s on M2 Pro, ~26 tok/s on M3 Max.

Qwen2-VL-7B on MLX — notes from a weekend of fighting bbox scaling

0 reply(ies)