Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on MATH-500 OpenR1 Harder

96.6Accuracy

Qwen-4B

93.68894.44495.295.956Feb 11, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
96.6
2026.02
96.6
2026.02
93.8