Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AIME

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mathematical ReasoningAIME 2024
Accuracy94
370
Mathematical ReasoningAIME
AIME Accuracy83.3
288
Mathematical ReasoningAIME 2025
Accuracy95
227
Mathematical ReasoningAIME 25
Accuracy94.5
201
Mathematical ReasoningAIME 2024
Pass@1 Accuracy83.33
165
Mathematical ReasoningAIME24
Accuracy97.3
160
Mathematical ReasoningAIME 2024 (test)
Accuracy96.67
159
Mathematical ReasoningAIME 24
Accuracy75.52
154
Mathematical ReasoningAIME 2024
Accuracy77.78
151
Mathematical ReasoningAIME 2025
Pass@1 Accuracy98.6
118
Mathematical ReasoningAIME 24
Accuracy71.1
113
Mathematical ReasoningAIME 2024
Accuracy88.67
104
Mathematical ReasoningAIME 2025
Pass@180.78
96
Mathematical ReasoningAIME 2024
Pass@183.8
86
Mathematical ReasoningAIME 24
AIME 24 Accuracy93.3
84
Mathematical ReasoningAIME24
Pass@1 Accuracy78.9
82
Mathematical ReasoningAIME 2025
Acc77
81
MathematicsAIME 2025
Accuracy76.7
66
Mathematical ReasoningAIME 25
pass@180
65
Mathematical ReasoningAIME 24/25
Accuracy35.3
64
MathematicsAIME25
Accuracy93.33
63
Mathematical ReasoningAIME 2025 (test)
Pass@1 Rate88.9
63
Mathematical Problem SolvingAIME 2024
Accuracy100
62
MathematicsAIME 2024
Accuracy83.8
60
Mathematical ReasoningAIME 2024
Mean Score (k=8)68.3
59
Showing 25 of 613 rows
...