Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Math Problem Solving on LiveMathBench
Loading...
100
AIME 24 Score
FoT (Round 2)
48
61.5
75
88.5
Apr 18, 2026
AIME 24 Score
AIME 25 Score
AMC Score
CCEE Score
CNMO Score
WLPMC Score
V202412 Hard Score
V202505 Hard Score
Overall Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
AIME 24 Score
AIME 25 Score
AMC Score
CCEE Score
CNMO Score
WLPMC Score
V202412 Hard Score
V202505 Hard Score
Overall Average Score
FoT (Round 2)
Base Model=Gemini-3-Pr...
2026.04
100
100
93.5
90.9
100
90.9
95.2
72
92.8
Isolated Agent
Base Model=Gemini-3-Pr...
2026.04
96.7
93.3
93.5
86.4
88.9
72.7
76.2
69
84.6
Isolated Agent
Base Model=DeepSeek-R1...
2026.04
50
40
67.4
84.1
72.2
27.3
52.4
36
53.7
FoT (Round 3)
Base Model=DeepSeek-R1...
2026.04
50
40
71.7
84.1
72.2
36.4
52.4
36
55.3
Feedback
Search any
task
Search any
task