Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Problem Solving on AIME 2025 (Pass@1)

95Pass@1

Gemini-3.0 Pro

Updated 3mo ago

Evaluation Results

Method	Links
Gemini-3.0 Pro 2025.12		95
GPT-5 High 2025.12		94.6
Kimi-K2 2025.12		94.5
DeepSeek-V3.2 2025.12		93.1
Claude-4.5-Sonnet 2025.12		87
MiniMax M2 2025.12		78.3
Qwen3-8B 2025.07		67.3
QUESTA-Nemotron-1.5B 2025.07		62.29
DeepSeek-R1-Distill-32B 2025.07		51.8
Nemotron-1.5B 2025.07		49.5
Qwen3-1.7B 2025.07		36.8
DeepSeek-R1-Distill-1.5B 2025.07		22.3