Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agent planning on WAH-NL
Loading...
20.48
Sub-SR
LoTA (Full Recompute)
8.624
11.702
14.78
17.858
Feb 27, 2026
Sub-SR
TTFT (s)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Sub-SR
TTFT (s)
LoTA (Full Recompute)
Model=Qwen-2.5-32B (INT4)
2026.02
20.48
1.221
LoTA + KEEP
Model=Qwen-2.5-32B (INT4)
2026.02
20.25
0.601
LoTA (Full Recompute)
Model=Qwen-2.5-14B
2026.02
19.58
0.362
LoTA + KEEP
Model=Qwen-2.5-14B
2026.02
19.52
0.226
LoTA + CacheBlend
Model=Qwen-2.5-32B (INT4)
2026.02
16.88
1.148
LoTA + CacheBlend
Model=Qwen-2.5-14B
2026.02
16.21
0.341
LoTA + Full Reuse
Model=Qwen-2.5-32B (INT4)
2026.02
12.74
0.578
LoTA + Full Reuse
Model=Qwen-2.5-14B
2026.02
9.08
0.203
Feedback
Search any
task
Search any
task