M mlxcommunity
All LLM VLM Audio Image Finetune Quantize Perf Tools
+ new thread
11 SCORE
LLM
Short answer: yes, at q4. ~7-9 tok/s, ~38GB RAM. Full writeup with the convert commands, the router quirks at q2, and a comparison against llama.cpp metal bac…
by halee · 2026-04-21 · last activity 2026-04-21 01:54
1 REPLIES