Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Task Execution on TEXTCRAFT-SYNTH 8K context Hard (evaluation set)
Loading...
88
Success Rate
Recursive
-3.52
20.24
44
67.76
May 7, 2026
Success Rate
Steps Taken
Time (s)
Updated 26d ago
Evaluation Results
Method
Method
Links
Success Rate
Steps Taken
Time (s)
Recursive
Context Window=8K trai...
2026.05
88
-
-
Single
Context Window=8K trai...
2026.05
0
-
-
Feedback
Search any
task
Search any
task