Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Task Execution on TEXTCRAFT-SYNTH 8K context Medium (evaluation set)
Loading...
96
SR
Recursive
13.84
35.17
56.5
77.83
May 7, 2026
SR
Steps
Time (s)
Updated 26d ago
Evaluation Results
Method
Method
Links
SR
Steps
Time (s)
Recursive
Context Window=8K trai...
2026.05
96
52
13.6
Single
Context Window=8K trai...
2026.05
17
25
11.1
Feedback
Search any
task
Search any
task