| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| AIME 2025 | SELF-THOUGHT | Acc@t180 | 24 | 1mo ago | |
| AIME 2024 | SELF-THOUGHT | Acc@t186.67 | 24 | 1mo ago | |
| Competition Math Average | ParaGator-Zero-4B | Pass@134 | 20 | 29d ago | |
| Olympiad | ParaGator-Zero-4B | Pass@154.89 | 20 | 29d ago | |
| AIME 2025 | Pass@127.37 | 20 | 29d ago | ||
| BrumoMath 2025 | ParaGator-Zero-4B | Pass@136.25 | 20 | 29d ago | |
| AIME 2025 | PAPO | Accuracy (avg@4)30.7 | 12 | 20d ago | |
| AIME 2024 | PAPO | Accuracy (avg@4)34.5 | 12 | 20d ago | |
| OlympiadBench | PAPO | Accuracy (avg@4)61.1 | 12 | 20d ago | |
| IMO (test) | PoT | Efficiency Ratio0.52 | 4 | 1mo ago |