6
SCORE
Quantize
Ran the same eval suite across three quantizations of Llama-3.1-8B on M2 Pro.
| quant | mem | tok/s | HumanEval | MMLU-redux |
|-------|------|-------|------…
0
REPLIES