Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Competitive Programming Agent Evaluation on ALE Bench
Loading...
1,909.4
Final Performance
Leeroo
788.904
1,079.802
1,370.7
1,661.598
Jan 29, 2026
Final Performance
Rank Percentile
Total Cost ($)
Updated 3mo ago
Evaluation Results
Method
Method
Links
Final Performance
Rank Percentile
Total Cost ($)
Leeroo
Model=Gemini-2.5-pro
2026.01
1,909.4
6.1
914.8
ALE-Agent
Model=Gemini-2.5-pro
2026.01
1,879.3
6.8
1,003.3
ALE Sequential
Model=Gemini-2.5-pro
2026.01
1,198
54.1
111
ALE one-shot
Model=Gemini-2.5-pro
2026.01
832
88.4
4.7
Feedback
Search any
task
Search any
task