Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME 24 (ACC, LEN)

43.75Accuracy

SFPO

-0.91810.678522.27533.8715Jun 9, 2025Jun 28, 2025Jul 18, 2025Aug 7, 2025Aug 26, 2025Sep 15, 2025Oct 5, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.10
43.75-
2025.10
42.5-
2025.10
34.17-
2025.10
32.92-
2025.10
25.42-
2025.10
20.4-
2025.06
19.15,522
2025.06
15.77,786
2025.06
15.51,784
2025.06
14.33,691
2025.06
12.77,925
2025.06
11.51,939
2025.06
10.34,042
2025.06
9.82,425
2025.06
6.21,632
2025.06
5.61,396
2025.06
2.81,971
2025.06
1.8940
2025.06
1.31,236
2025.06
0.82,033