Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Inference Throughput on Llama Instruct 3.1-8B (internal harness)

6,991Throughput (TPS)

Pre-compressed only

784.282,395.644,0075,618.36Mar 18, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
6,991
2026.03
6,872
2026.03
4,606
2026.03
3,316
2026.03
2,194
2026.03
2,194
2026.03
1,812
2026.03
1,023