Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on MATH (Pass@1, Pass@2)
Loading...
88.42
Pass@1
Phi-4-mini + Mistral3-3B
52.8416
62.0783
71.315
80.5517
Jan 29, 2026
Pass@1
Pass@2
Updated 3d ago
Evaluation Results
Method
Method
Links
Pass@1
Pass@2
Phi-4-mini + Mistral3-3B
Training Framework=COR...
2026.01
88.42
92.08
Phi-4-mini-reasoning
Training Framework=COR...
2026.01
85.24
90.01
Mistral3-3B-Reasoning
Training Framework=COR...
2026.01
83.72
89.12
Phi-4-mini + Mistral3-3B + Oracle
Training Framework=SD-...
2026.01
71.85
77.7
Phi-4-mini-reasoning
Training Framework=SD-...
2026.01
69.68
73.82
Mistral3-3B-Reasoning
Training Framework=SD-...
2026.01
67.25
73.15
Phi-4-mini + Mistral3-3B + Oracle
Training Framework=Bas...
2026.01
59.25
70.7
Phi-4-mini-reasoning
Training Framework=Bas...
2026.01
57.04
68.16
Mistral3-3B-Reasoning
Training Framework=Bas...
2026.01
54.21
64.44
Feedback
Search any
task
Search any
task