Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Inference Throughput on Llama-8B

115.2Throughput (Tokens/s)

GPTQ

45.83263.84181.8599.859Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
115.2
2026.01
113.1
2026.01
112.8
2026.01
48.5