Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-context reasoning on OOLONG-REAL Average 650 samples
Loading...
0.32
Average Reward
Recursive Agent Optimization
0.17752
0.21451
0.2515
0.28849
May 7, 2026
Average Reward
Updated 26d ago
Evaluation Results
Method
Method
Links
Average Reward
Recursive Agent Optimization
Steps=61.5, Time (s)=1...
2026.05
0.32
Recursive Agent Optimization
Steps=61.5, Time (s)=1...
2026.05
0.315
Single Agent
Steps=7.1, Time (s)=12.6
2026.05
0.203
Single Agent
Steps=7.1, Time (s)=12.6
2026.05
0.183
Feedback
Search any
task
Search any
task