Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AIME25 (Accuracy)
Loading...
76.3
Accuracy
LUSPO
0.692
20.321
39.95
59.579
Feb 5, 2026
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
LUSPO
Base model=Qwen3-30B-A...
2026.02
76.3
GSPO
Base model=Qwen3-30B-A...
2026.02
59.2
w/o RLVR
Base model=Qwen3-30B-A...
2026.02
57.2
LUSPO
Base model=Qwen2.5-7B-...
2026.02
13.9
GSPO
Base model=Qwen2.5-7B-...
2026.02
11.2
w/o RLVR
Base model=Qwen2.5-7B-...
2026.02
3.6
Feedback
Search any
task
Search any
task