Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Llama-3.1-8B

Benchmarks

Task NameDataset NameSOTA ResultTrend
Decoding LatencyLlama-3.1-8B 16k sequence length v1 (inference)
Decoding latency (s)0.024
8
Showing 1 of 1 rows