Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Context Length

Benchmarks

Task NameDataset NameSOTA ResultTrend
Inference Efficiency90k Context Length Llama-3.1-8B
Throughput (queries/s)8.9
4
Inference Efficiency30k Context Length (Llama-3.1-8B)
Inference Throughput (QPS)15.8
4
Inference Efficiency30k Context Length Llama-2-7B
Inference Throughput (QPS)6.6
4
Showing 3 of 3 rows