Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Task Execution on TEXTCRAFT-SYNTH All (eval)
Loading...
96
Success Rate
Recursive Agent
72.08
78.29
84.5
90.71
May 7, 2026
Success Rate
Steps
Time (s)
Updated 26d ago
Evaluation Results
Method
Method
Links
Success Rate
Steps
Time (s)
Recursive Agent
Context Window (Train/...
2026.05
96
115
19.8
Single Agent
Context Window (Train/...
2026.05
73
54
35.7
Feedback
Search any
task
Search any
task