| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Mathematical Reasoning | HMMT25 | Accuracy86.7 | 119 | |
| Mathematical Reasoning | HMMT25 | Accuracy (%)92.5 | 115 | |
| Mathematical Reasoning | HMMT25 | Pass@1653.3 | 24 | |
| Mathematical Reasoning | HMMT25 | Avg@12 Accuracy48.1 | 21 | |
| Mathematical Reasoning | HMMT25 | Accuracy (HMMT25)16.3 | 21 | |
| Math Reasoning | HMMT25 | Accuracy (HMMT25)34.9 | 21 | |
| Mathematical Reasoning | HMMT25 | Avg@3217.9 | 18 | |
| Math Reasoning | HMMT25 | Pass@866.7 | 14 | |
| Long-chain Mathematical Reasoning | HMMT25 | Accuracy avg@3282.2 | 6 | |
| Mathematics | HMMT25 | Throughput (Req/s)16.83 | 6 | |
| Mathematical Reasoning | HMMT25 | Pass@853.33 | 5 | |
| Math reasoning | HMMT25 Nov. | Mean@815.83 | 4 |