Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AIME OpenR1-Math Harder Subset 2025 (Accuracy)
Loading...
66.1
Accuracy
RePO
45.612
50.931
56.25
61.569
Feb 11, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
RePO
Backbone=Qwen, Paramet...
2026.02
66.1
Qwen-4B
Backbone=Qwen, Paramet...
2026.02
65.4
LUFFY
Backbone=Qwen, Paramet...
2026.02
46.4
Feedback
Search any
task
Search any
task