Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Proof on IMO 2025
Loading...
88.69
Score
STAR-PólyaMath
28.0268
43.7759
59.525
75.2741
May 19, 2026
Score
Updated 14d ago
Evaluation Results
Method
Method
Links
Score
STAR-PólyaMath
Backbone=all GPT-5.5xh
2026.05
88.69
GPT-5.4
Reasoning Effort=xhigh
2026.05
78.57
GPT-5.5
Reasoning Effort=xhigh
2026.05
70.83
Claude Opus 4.7
Reasoning Effort=xhigh
2026.05
70.24
Gemini 3.1 Pro
2026.05
67.26
GPT-5.2
Reasoning Effort=high
2026.05
58.33
Gemini 3 Pro
2026.05
41.67
Claude Opus 4.6
Reasoning Effort=high
2026.05
30.36
Feedback
Search any
task
Search any
task