Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AMC (Pass@1 Avg@32)
Loading...
73.8
Pass@1 (Avg@32)
Fold-RL
56.64
61.095
65.55
70.005
Feb 3, 2026
Pass@1 (Avg@32)
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1 (Avg@32)
Fold-RL
Model Backbone=Qwen2.5...
2026.02
73.8
Unfold-RL
Model Backbone=Qwen3-4...
2026.02
73.2
Mix-RL
Model Backbone=Qwen3-4...
2026.02
72.8
Fold-RL
Model Backbone=Qwen3-4...
2026.02
72.2
Mix-RL
Model Backbone=Qwen2.5...
2026.02
71.9
Unfold-RL
Model Backbone=Qwen2.5...
2026.02
71.2
Unfold-RL
Model Backbone=Qwen2.5...
2026.02
70.2
Unfold-RL
Model Backbone=Qwen3-4...
2026.02
69.7
Cold-Start
Model Backbone=Qwen2.5...
2026.02
65.4
Zero-RL
Model Backbone=Qwen3-4...
2026.02
65.4
Cold-Start
Model Backbone=Qwen3-4...
2026.02
64.1
Cold-Start
Model Backbone=Qwen2.5...
2026.02
62.4
Zero-RL
Model Backbone=Qwen2.5...
2026.02
58.9
Cold-Start
Model Backbone=Qwen3-4...
2026.02
57.3
Feedback
Search any
task
Search any
task