Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AMC (Pass@1 Accuracy, Length Exceeding Ratio)
Loading...
85.2
Pass@1 Accuracy
GRPO
61.072
67.336
73.6
79.864
Jan 8, 2026
Pass@1 Accuracy
Length Exceeding Ratio
Updated 3d ago
Evaluation Results
Method
Method
Links
Pass@1 Accuracy
Length Exceeding Ratio
GRPO
Model=Qwen3-4B-Instruct
2026.01
85.2
0.7
-
Model=Qwen3-4B-Instruct
2026.01
84.5
33.9
GDPO
Model=Qwen3-4B-Instruct
2026.01
84.3
0.1
GDPO
Model=DeepSeek-R1-7B
2026.01
84
0.3
GRPO
Model=DeepSeek-R1-7B
2026.01
83.8
0.6
-
Model=DeepSeek-R1-7B
2026.01
82.9
57.2
GDPO
Model=DeepSeek-R1-1.5B
2026.01
69
2.3
GRPO
Model=DeepSeek-R1-1.5B
2026.01
64.5
3.2
-
Model=DeepSeek-R1-1.5B
2026.01
62
67.5
Feedback
Search any
task
Search any
task