Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Problem Solving on AIME 2025 (Top-1 Accuracy)

26.97Top-1 Accuracy (%)

ePF w/ LaM

1.62528.205114.78521.3649Oct 7, 2025
Updated 18d ago

Evaluation Results

MethodLinks
2025.10
26.97---
2025.10
25.16---
2025.10
23.13---
2025.10
22.99---
2025.10
22.06---
2025.10
20.8---
2025.10
20.25---
2025.10
19.6---
2025.10
19.22---
2025.10
19.1---
2025.10
19.09---
2025.10
19.03---
2025.10
18.6---
2025.10
18.45---
2025.10
18.4---
2025.10
18.4---
2025.10
18.38---
2025.10
17.83---
2025.10
16.4---
2025.10
15.8---
2025.10
13.55---
2025.10
13.35---
2025.10
12.61---
2025.10
11.54---
2025.10
9.8---
2025.10
2.6---
2025.10
--3.33-
2025.10
-5.133.62.52
2025.10
-9.457.44.32
2025.10
-7.324.52.87
2025.10
-10.827.283.42
2025.10
--6.66-
2025.10
-17.4115.814.9
2025.10
-14.1916.216.81
2025.10
-21.6119.817.61
2025.10
-28.8325.121.96