Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AMC 2023

96.02Accuracy

JustRL-Nemotron

33.599249.804666.0182.2154Feb 17, 2025Apr 23, 2025Jun 28, 2025Sep 1, 2025Nov 6, 2025Jan 10, 2026Mar 17, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
96.02--
2025.12
93.44--
2025.12
92.3--
2025.12
91.9--
2025.12
91.02--
2025.12
90.6--
2025.12
90.55--
2025.06
90-1,229.25
2025.06
90-1,942.4
2026.02
89.69-3,260
2026.02
89.45-2,894
2025.06
89.3--
2025.06
89.3--
2025.06
89.3--
2025.06
89.3--
2026.03
89--
2026.03
89--
2026.02
88.9-5,295
2025.12
88.8--
2025.12
88.75--
2025.06
88.2--
2026.02
88.12-3,560
2025.06
88--
2026.02
87.66-5,509
2026.02
87.5-5,980
2026.02
86.88-3,641
2026.02
86.41--
2026.02
85.8-6,005
2025.06
85-1,267.83
2025.06
85-2,158
2025.06
85-1,932.63
2026.02
84.53--
2025.12
84.3--
2025.06
84--
2025.06
83--
2026.03
83--
2026.03
83--
2025.12
82.3--
2025.06
82--
2026.03
82--
2026.02
81.88--
2025.06
81-1,300.53
2025.06
81-1,951.88
2025.06
80--
2026.02
79.53--
2025.12
79.3--
2025.06
79--
2026.03
78--
2026.03
78--
2026.02
75.94-3,899
2025.06
75-2,812.75
2025.12
73.83--
2025.12
73.7--
2026.02
73.28-3,037
2026.02
72.1-2,798
2026.02
70.47-7,046
2026.02
69.53-7,091
2026.02
67.66-7,657
2026.02
67.19-3,342
2026.02
66.72-7,742
2026.02
66.41--
2025.06
65.2--
2025.06
65-1,839.23
2025.12
63.82--
2025.06
63.5--
2025.06
63-1,855.85
2025.06
63-1,890.35
2025.06
61.6--
2025.06
61.6--
2026.02
61.25-1,179
2025.06
61.2--
2026.03
61--
2026.03
60--
2026.02
59.53--
2025.02
57.5--
2025.09
55.62--
2026.03
55--
2025.06
53--
2025.06
51.1--
2025.06
50.6--
2025.09
50.31--
2025.02
50--
2026.02
49.22-1,377
2025.06
49--
2025.06
48.6--
2025.02
47.5--
2025.02
47.5--
2026.01
47.03--
2025.06
47--
45.31--
2025.06
45-3,263.75
2025.06
43.2--
2025.06
42.4--
2025.06
42.4--
2026.02
42.34--
2025.09
40.31--
2025.02
40--
37.81--
2026.01
37.34--
2026.03
36--
Showing 100 of 138 rows