Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Inference Throughput on LLaMA-3 8B

1,020Decode Throughput (tok/s)

IQ3_S

458.4604.2750895.8Mar 30, 2026
Updated 18d ago

Evaluation Results

MethodLinks
2026.03
1,02047,8002.1
2026.03
96051,2002
2026.03
89042,1001.9
2026.03
48028,4001