Quantize

Q2 vs Q4 vs Q8 on Llama-3.1-8B — actual numbers

by tito · 2026-04-21 02:00

6

OP · tito

Ran the same eval suite across three quantizations of Llama-3.1-8B on M2 Pro.

q4 is the sweet spot.

0 reply(ies)

sign in to reply.