| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Mathematical Reasoning | AIMO-3 Leaderboard (public) | p-hat69 | 5 | |
| Mathematical Reasoning | AIMO 2025 (reference set) | Pass@120 | 4 | |
| Mathematical Problem Solving | AIMO-3 2026 | Winner46 | 1 | |
| Mathematical Problem Solving | AIMO-1 2024 | Winner Score29 | 1 |