Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AIME24 (Average Accuracy)
Loading...
23.5
Average Accuracy
GRPO
11.02
14.26
17.5
20.74
Dec 1, 2025
Average Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Accuracy
GRPO
Backbone=Qwen3-4B-Base...
2025.12
23.5
RePro
Backbone=Qwen3-4B-Base...
2025.12
21
Original
Backbone=Qwen3-4B-Base...
2025.12
11.5
Feedback
Search any
task
Search any
task