Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Inference Throughput on Llama-3B

215.6Throughput (TOK/s)

GPTQ

114.512140.756167193.244Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
215.6
2026.01
210
2026.01
209.5
2026.01
118.4