Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
End-to-end single-step decoding on DeepSeek-R1-Distill-LLaMA-8B 64K Context
Loading...
24.1
Latency (ms)
LessIsMore
23.688
26.469
29.25
32.031
Aug 9, 2025
Latency (ms)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Latency (ms)
LessIsMore
Token Budget=2K, Servi...
2025.08
24.1
Quest
Token Budget=2K, Servi...
2025.08
24.8
TidalDecode
Token Budget=2K, Servi...
2025.08
25.4
Full Attention
Token Budget=2K, Servi...
2025.08
34.4
Feedback
Search any
task
Search any
task