Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Proof on USAMO 2026
Loading...
99.4
Overall Score (%)
STAR-PólyaMath
44.9248
59.0674
73.21
87.3526
May 19, 2026
Overall Score (%)
Updated 14d ago
Evaluation Results
Method
Method
Links
Overall Score (%)
STAR-PólyaMath
Backbone=all GPT-5.5xh
2026.05
99.4
GPT-5.5
Reasoning Effort=xhigh
2026.05
98.21
GPT-5.4
Reasoning Effort=xhigh
2026.05
95.24
Gemini 3.1 Pro
2026.05
74.4
Claude Opus 4.7
Reasoning Effort=xhigh
2026.05
63.1
DeepSeek-v4-Pro
Reasoning Effort=Max
2026.05
60.71
Kimi K2.6
Reasoning Effort=Think
2026.05
51.19
GPT-5.2
Reasoning Effort=high
2026.05
50
Claude Opus 4.6
Reasoning Effort=high
2026.05
47.02
Feedback
Search any
task
Search any
task