| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Mathematical Reasoning | AMC 23 | Accuracy100 | 198 | |
| Mathematical Reasoning | AMC | Accuracy97.5 | 151 | |
| Mathematical Reasoning | AMC | Pass@197.5 | 112 | |
| Math reasoning | AMC | Accuracy80 | 70 | |
| Mathematical Reasoning | AMC 2023 | Accuracy96.02 | 65 | |
| Mathematical Reasoning | AMC 23 | Pass@187.5 | 46 | |
| Mathematical Reasoning | AMC23 | Pass@192.5 | 43 | |
| Mathematical Reasoning | AMC23 | Avg@1689.22 | 36 | |
| Mathematical Reasoning | AMC23 (test) | Pass@192.2 | 36 | |
| Mathematical Reasoning | AMC 2023 | Accuracy90 | 32 | |
| Mathematical Reasoning | AMC 2023 | Pass@188.2 | 30 | |
| Mathematical Reasoning | AMC23 | Avg@1677.6 | 29 | |
| Mathematical Reasoning | AMC 23 | Acc93.3 | 28 | |
| Mathematical Reasoning | AMC 2023 | Avg@3286.75 | 27 | |
| Mathematical Reasoning | AMC 2023 (test) | Accuracy81.8 | 27 | |
| Math Reasoning | AMC 2023 | Accuracy90.5 | 26 | |
| Mathematical Reasoning | AMC 2023 | Accuracy100 | 26 | |
| Mathematical Reasoning | AMC23 | AVG@893.1 | 25 | |
| Mathematical Reasoning | AMC23 | Accuracy68.3 | 24 | |
| Mathematical Reasoning | AMC 23 | Accuracy90.5 | 24 | |
| Mathematical Reasoning | AMC | Latency (s)1.7 | 24 | |
| Mathematical Reasoning | AMC | C_mem (Ratio)0.1 | 24 | |
| Mathematical Reasoning | AMC 2023 | pass@891.9 | 23 | |
| Reasoning | AMC 2023 | Accuracy (AMC 2023)90.63 | 21 | |
| Mathematical Reasoning | AMC | Avg@3273.45 | 21 |