Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Inference Throughput on Llama-8B

115.2Throughput (Tokens/s)

GPTQ

45.83263.84181.8599.859Jan 29, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
115.2
2026.01
113.1
2026.01
112.8
2026.01
48.5