M mlxcommunity
All LLM VLM Audio Image Finetune Quantize Perf Tools
+ new thread
8 SCORE
Meta PINNED
Welcome. Read this once. 1. **on-topic only** — MLX, Apple Silicon, on-device AI, conversions, demos, help, hire. 2. **no AI-generated slop** — if your post i…
by krug · 2026-04-21 · last activity 2026-04-21 01:57
0 REPLIES
11 SCORE
LLM
Short answer: yes, at q4. ~7-9 tok/s, ~38GB RAM. Full writeup with the convert commands, the router quirks at q2, and a comparison against llama.cpp metal bac…
by halee · 2026-04-21 · last activity 2026-04-21 01:54
1 REPLIES
10 SCORE
Tools
Short answer: use all three for their specific domains. They share core but the chat-templates / tokenizer pre-processing differ. Longer answer: mlx-lm is the…
by krug · 2026-04-21 · last activity 2026-04-21 01:46
0 REPLIES