Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Inference Throughput on Llama-3B

215.6Throughput (TOK/s)

GPTQ

114.512140.756167193.244Jan 29, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
215.6
2026.01
210
2026.01
209.5
2026.01
118.4