Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on HMMT25 (Acc avg@32)
Loading...
82.2
Accuracy avg@32
IOP-GSPO
50.272
58.561
66.85
75.139
Apr 19, 2026
Accuracy avg@32
Updated 26d ago
Evaluation Results
Method
Method
Links
Accuracy avg@32
IOP-GSPO
Model Architecture=Qwe...
2026.04
82.2
GSPO
Model Architecture=Qwe...
2026.04
75.4
Base
Model Architecture=Qwe...
2026.04
73.9
IOP-GSPO
Model Architecture=Qwe...
2026.04
63.2
GSPO
Model Architecture=Qwe...
2026.04
55.4
Base
Model Architecture=Qwe...
2026.04
51.5
Feedback
Search any
task
Search any
task