Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Problem Solving on MATH500

93Accuracy

Self-MoA

21.2439.8758.577.13May 16, 2025Jul 12, 2025Sep 8, 2025Nov 5, 2025Jan 2, 2026Mar 1, 2026Apr 28, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2025.07
93-----
2025.07
92.6-----
2025.07
90.4-----
2025.07
90.4-----
2025.07
90.2-----
2025.07
90-----
2025.07
90-----
2026.04
88.2-893-2,320-
2025.07
88-----
2025.07
88-----
2025.07
87.8-----
2025.07
87.8-----
2026.04
87.1-953-1,974-
2025.05
86.87----2,666.58
2025.05
86.73----5,387.19
2026.04
86.3-816-1,701-
2026.04
85.8-4,100-6,010-
2025.07
85.8-----
2025.11
85.6-----
2025.07
85.6-----
2025.11
84.4-----
2026.04
84.4-2,794-3,958-
2025.07
84.4-----
2025.07
84.38-----
2026.04
84.2-1,471-2,401-
2025.07
84-----
2025.11
83.2-----
2025.11
82.8-----
2025.07
82.8-----
2026.02
80.6-----
2025.05
79.73----582.58
2025.07
78.8-----
2025.05
78.47----2,326.85
2026.02
78.4-----
2026.04
77.8-519-1,360-
2025.05
76.73----1,753.42
2026.04
76.6-495-1,096-
2025.11
76.1-----
2026.04
75.8-523-825-
2025.07
75.6-----
2025.07
75.2-----
2025.05
74.93----5,327.12
2025.07
74.6-----
2025.11
74.4-----
2025.07
74.2-----
2025.07
73.6-----
2025.07
73.2-----
2025.11
73-----
2025.07
73-----
2025.07
73-----
2026.04
72.5-2,117-2,204-
2025.05
72.47----2,088.44
2026.04
71.9-1,185-1,368-
2025.07
70-----
2026.04
69.1-3,059-2,952-
2025.11
68.8-----
2025.11
68.4-----
2025.11
61.8-----
2025.05
61.13----823.89
2025.11
60-----
2025.07
55.2-----
2025.11
54.2-----
2026.02
53-----
2026.02
51.2-----
2025.11
51.2-----
2025.11
48.8-----
2025.11
47.2-----
2026.01
40.6558.7682.281.91,968-
2026.01
40.4585.81,02457.23,663-
2026.01
40706.52,04834.512,764-
2026.01
40464.6618.375.22,207-
2026.01
38.4423.951282.81,202-
2026.01
37428.8486.388.21,398-
2025.05
35.53----1,499.54
2026.01
35.4244.725695.6462-
2025.09
33-----
2025.11
32.8-----
2025.09
31.8-----
2025.09
31.4-----
2025.09
30-----
2026.01
29123.312896.3208-
2025.11
27.3-----
2026.01
2461.76496.498-