Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LLM Decoding on Llama-2-70B

0.2163Per-step Decoding Latency

Pre3

0.2117120.2426810.273650.304619Jun 4, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.06
0.2163
2025.06
0.2407
2025.06
0.303
2025.06
0.331