Long-context reasoning on OOLONG-REAL Average 650 samples

0.32Average Reward

Recursive Agent Optimization

Updated 2mo ago

Evaluation Results

Method	Links
Recursive Agent Optimization 2026.05		0.32
Recursive Agent Optimization 2026.05		0.315
Single Agent 2026.05		0.203
Single Agent 2026.05		0.183