Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on AIME 24 (pass@1, Avg Rank)
Loading...
33.33
Pass@1
ARPO
-1.3332
7.6659
16.665
25.6641
Apr 9, 2026
Pass@1
Average Rank
Updated 9d ago
Evaluation Results
Method
Method
Links
Pass@1
Average Rank
ARPO
2026.04
33.33
3.57
SEARL
2026.04
33.33
1.43
GRPO
2026.04
13.33
2.43
DAPO
2026.04
13.33
3
Reinforce++
2026.04
10
4.57
TIR Prompt
2026.04
0
5.29
Feedback
Search any
task
Search any
task