Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on MathInstruct Scenario 1

68.4Accuracy

LaDa

15.77629.43843.156.762Feb 21, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
68.49.6
2026.02
68.4-
2026.02
58.8-
2026.02
58.2-
2026.02
54.4-
2026.02
52.21
2026.02
52.2-
2026.02
51.6-
2026.02
51.2-
2026.02
48.62.4
2026.02
470.6
2026.02
47-
2026.02
46.4-
2026.02
46.2-
2026.02
46-
2026.02
46-
2026.02
45.92.6
2026.02
45.9-
2026.02
45.4-
2026.02
44-
2026.02
43.5-
2026.02
43.4-
2026.02
43.3-
2026.02
43.3-
2026.02
43.3-
2026.02
43.2-
2026.02
43.1-
2026.02
42.7-
2026.02
41.8-
2026.02
41.2-
2026.02
41-
2026.02
38.6-
2026.02
37.6-
2026.02
30.8-
2026.02
30.61.2
2026.02
29.4-
2026.02
28.5-
2026.02
27.5-
2026.02
27.20.8
2026.02
26.4-
2026.02
26.2-
2026.02
26-
2026.02
25.83
2026.02
25.8-
2026.02
24-
2026.02
22.8-
2026.02
22.6-
2026.02
22-
2026.02
22-
2026.02
21.4-
2026.02
21.4-
2026.02
19.8-
2026.02
17.8-