M mlxcommunity
VLM

Qwen2-VL-7B on MLX — notes from a weekend of fighting bbox scaling

by prism · 2026-04-21 02:00
13

Ported Qwen2-VL-7B. Quick TL;DR:

  • weights: q4 fits in ~5.2GB.
  • image preprocessing: Qwen expects a very specific resize+crop.
  • inference: ~14 tok/s on M2 Pro, ~26 tok/s on M3 Max.

0 reply(ies)

sign in to reply.