Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Accuracy on AMC Math Reasoning

80Accuracy

LLM-J

-1.74419.47840.761.922Oct 5, 2025Oct 30, 2025Nov 25, 2025Dec 21, 2025Jan 16, 2026Feb 11, 2026Mar 9, 2026
Updated 19d ago

Evaluation Results

MethodLinks
2025.11
80
2025.11
80
2025.11
73.3
2025.11
73.3
2025.11
70.7
2025.11
70.7
2025.11
68.5
2025.11
68
2025.11
68
2025.11
67.4
2025.11
67.4
2025.11
67
2025.11
66.7
2025.11
66.7
2025.11
66.7
2025.11
66.7
2025.11
66
2025.11
66
2025.11
65.5
2025.11
65
2025.11
63.3
2025.11
62
2025.11
61.2
2025.11
61.2
2025.11
61
2025.11
60.8
2025.11
60.8
2025.11
60.5
2025.11
60
2025.11
60
2025.11
60
2025.11
60
2025.11
60
2025.11
60
2025.11
60
2025.11
50
2025.11
48
2025.11
45
2025.11
40
2025.10
38.6
2025.10
36.1
2026.03
35.83
2025.11
35
2025.10
34.9
2025.10
33.7
2025.10
33.7
2025.11
33.3
2025.11
33.3
2025.10
32.5
2025.10
30.1
2025.11
30
2025.11
30
2025.10
26.5
2025.10
25.3
2025.10
24.1
2025.11
23.3
2025.11
23.3
2025.11
21.6
2025.11
21.6
2025.10
21.2
2026.03
20.48
2025.11
20.2
2025.11
20.2
2025.11
20.1
2025.11
20.1
2025.11
20.1
2025.11
20.05
2025.11
20
2025.11
20
2025.11
20
2025.11
20
2026.03
19.68
2026.03
19.28
2026.03
16.47
2025.10
15.7
2026.03
15.66
2026.03
15.66
2025.11
13.3
2025.11
13.3
2026.03
12.45
2026.03
12.05
2025.10
12
2026.03
11.65
2025.10
8.4
2026.03
8.03
2026.01
7.6
2026.01
6.8
5.5
2026.01
5.1
2026.01
5.1
2026.01
4
2.7
2026.01
2.3
2.1
2026.01
1.4