| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MATH | DeepSeek-R1 | Accuracy97.6 | 229 | 2mo ago | |
| AIME 2024 | Accuracy100 | 113 | 1mo ago | ||
| MATH500 | Self-MoA | Accuracy93 | 83 | 16d ago | |
| AIME 2025 | Score100 | 76 | 14d ago | ||
| MATH | Accuracy95.86 | 75 | 21d ago | ||
| AIME 25 | Accuracy93.3 | 71 | 1mo ago | ||
| Gaokao MathQA | Qwen2.5-Math-72B | Accuracy86.3 | 60 | 2mo ago | |
| LiveMath held-out (test) | SkillOpt | Score78.4 | 59 | 9d ago | |
| AIME 2024 | Self-MoA | Top-1 Accuracy76.67 | 54 | 16d ago | |
| AIME | MSV 64 | AIME Score1,289.8 | 52 | 3mo ago | |
| AIME 2025 | Top-1 Accuracy (%)91.67 | 46 | 23d ago | ||
| MATH | Accuracy95.7 | 40 | 1mo ago | ||
| AIME 25 | CoT | Average Time15.69 | 39 | 21d ago | |
| MATH | CoT | Average Time3.94 | 39 | 21d ago | |
| AIME 24 | CoT | Average Time15.58 | 39 | 21d ago | |
| Code2Math | Accuracy Ratio98 | 30 | 3mo ago | ||
| AIME | Accuracy85 | 28 | 23d ago | ||
| MinervaMath (test) | PASER | Accuracy21.2 | 28 | 3mo ago | |
| MathVerse (testmini) | Accuracy64.9 | 28 | 3mo ago | ||
| AMC | OWPO | Pass@178.36 | 27 | 12d ago | |
| MATH-Vision (test) | Accuracy68.8 | 26 | 3mo ago | ||
| MATH (test) | Gemini-Ultra | Accuracy53.2 | 25 | 21d ago | |
| MATH 500 | Accuracy95.66 | 24 | 14d ago | ||
| Olympiad | GCPO | Pass@k47.1 | 24 | 21d ago | |
| MINERVA | GCPO | Pass@k37.5 | 24 | 21d ago |