Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Inference Throughput on Llama-1B

310.5Throughput (Tokens/sec)

GPTQ

200.988229.419257.85286.281Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
310.5
2026.01
303.1
2026.01
302.8
2026.01
205.2