Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on MATH-500 OpenR1 Harder
Loading...
96.6
Accuracy
Qwen-4B
93.688
94.444
95.2
95.956
Feb 11, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen-4B
Backbone=Qwen, Paramet...
2026.02
96.6
RePO
Backbone=Qwen, Paramet...
2026.02
96.6
LUFFY
Backbone=Qwen, Paramet...
2026.02
93.8
Feedback
Search any
task
Search any
task