Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Decoding on ShareGPT

2.4Latency (ms/token)

Llama2-7B

1.4847.66713.8520.033Mar 12, 2026
Updated 2mo ago

Evaluation Results

MethodLinks
2026.03
2.412.9
2026.03
3.113.8
2026.03
8.513.1
2026.03
8.613.3
2026.03
25.326.3