Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Proof on IMO-ProofBench
Loading...
91.9
Advanced Score
Aletheia
26.068
43.159
60.25
77.341
Mar 19, 2026
Advanced Score
Basic Score
Overall Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Advanced Score
Basic Score
Overall Score
Aletheia
2026.03
91.9
-
-
Gemini 3 Deep Think
2026.03
76.7
-
-
Gemini Deep Think (IMO Gold)
2026.03
65.7
89
76.7
DeepSeek-Math-V2-671B-A37B
2026.03
61.9
99
80.2
DeepSeek-Math-V2-671B-A37B
Reproduced score=true,...
2026.03
57.7
99.5
78.6
Nemotron-Cascade-2-30B-A3B
Judge model=DeepSeek-V...
2026.03
53.4
92.5
72.9
GPT-5.2-Thinking (high)
2026.03
35.7
-
-
Gemini 3 Pro
2026.03
30
-
-
GPT-5 Pro
2026.03
28.6
-
-
Feedback
Search any
task
Search any
task