Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on MATH500 (pass@1 accuracy)
Loading...
82.2
Pass@1 Accuracy
AERO
67.64
71.42
75.2
78.98
Feb 3, 2026
Pass@1 Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Pass@1 Accuracy
AERO
Backbone Model=Qwen3-8...
2026.02
82.2
R-Zero
Backbone Model=Qwen3-8...
2026.02
82
AERO
Backbone Model=Qwen3-8...
2026.02
81.8
AERO
Backbone Model=Qwen3-8...
2026.02
79.8
AERO
Backbone Model=Qwen3-8...
2026.02
79.4
AERO
Backbone Model=Qwen3-8...
2026.02
79.2
Qwen3-8B-Base
Backbone Model=Qwen3-8...
2026.02
78
Absolute Zero
Backbone Model=Qwen3-8...
2026.02
76.6
Absolute Zero
Backbone Model=Qwen3-4...
2026.02
76.2
R-Zero
Backbone Model=Qwen3-4...
2026.02
74.8
AERO
Backbone Model=Qwen3-4...
2026.02
74.8
AERO
Backbone Model=Qwen3-4...
2026.02
74.4
AERO
Backbone Model=Qwen3-4...
2026.02
73.8
AERO
Backbone Model=Qwen3-4...
2026.02
72.6
AERO
Backbone Model=Qwen3-4...
2026.02
70.8
Qwen3-4B-Base
Backbone Model=Qwen3-4...
2026.02
68.2
Feedback
Search any
task
Search any
task