Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical and General Reasoning on DeepMATH (test)

83.4MATH 500 Score

BF16

56.77663.68870.677.512Jan 20, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.01
83.4-44.236.154.6
2026.01
80.2-47.233.853.7
2026.01
69.763.446.231.852.8
2026.01
57.8-42.632.644.3