Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
End-to-end single-step decoding on DeepSeek-R1-Distill-LLaMA-8B 16K Context
Loading...
23
Latency (ms)
LessIsMore
22.908
23.529
24.15
24.771
Aug 9, 2025
Latency (ms)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Latency (ms)
LessIsMore
Token Budget=2K, Servi...
2025.08
23
Quest
Token Budget=2K, Servi...
2025.08
24.2
TidalDecode
Token Budget=2K, Servi...
2025.08
24.3
Full Attention
Token Budget=2K, Servi...
2025.08
25.3
Feedback
Search any
task
Search any
task