Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on MGSM

91.7Accuracy

Qwen-3-32B

0.49224.17147.8571.529May 17, 2023Oct 29, 2023Apr 12, 2024Sep 25, 2024Mar 10, 2025Aug 23, 2025Feb 5, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
91.7-
2024.07
91.6-
2024.07
91.6-
2026.02
91.6-
2024.07
90.5-
2026.02
90.5-
2026.02
90-
2026.02
89.6-
2025.02
89.3-
89.01-
2026.02
88.4-
2025.02
88.16-
2026.02
87.4-
87.36-
2024.07
86.9-
2026.02
86-
2024.07
85.9-
2025.02
82.27-
2025.02
78.37-
2026.02
76.6-
2026.02
76.1-
2023.05
75.9-
2026.02
73-
2026.02
72.7-
2023.05
72.2-
2026.02
71.9-
2024.07
71.1-
2024.07
68.9-
2026.02
67.3-
2025.02
66.9-
2025.02
60.8-
2026.02
60.5-
2023.05
60.4-
2025.02
59-
2026.02
58.9-
2026.01
58.44-
2024.07
53.2-
2026.01
53.2-
2026.01
52-
2024.07
51.4-
2023.05
49.9-
2026.01
44-
2024.10
42-
2024.10
42-
2024.10
42-
2024.10
42-
2024.10
42-
2024.10
42-
2024.10
42-
2024.10
42-
2024.10
42-
2024.10
42-
2024.10
42-
2024.10
42-
2024.10
40-
2024.10
39-
2024.10
38-
2024.10
38-
2024.10
38-
2024.10
38-
2024.10
38-
2024.10
38-
2024.10
38-
2024.10
38-
2024.10
38-
2024.10
37-
2024.10
37-
2024.10
34-
2026.01
33.2-
2024.10
33-
2024.10
33-
2024.10
32-
2024.10
32-
2024.10
32-
2024.10
32-
2024.10
32-
2026.01
31.6-
2024.07
29.9-
2024.10
24-
2024.10
24-
2024.10
23-
2026.01
17.2-
2026.01
14.4-
2026.01
10.4-
2026.01
10-
2026.01
8.4-
2026.01
7.6-
2024.10
7-
2024.10
6-
2024.10
6-
2024.10
6-
2024.10
6-
2024.10
6-
2024.10
6-
2024.10
6-
2024.10
5-
2024.10
5-
2024.10
4-
2024.10
4-
2024.10
4-
Showing 100 of 118 rows