| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Mathematical Reasoning | CMATH | Accuracy95.7 | 47 | |
| Mathematical Reasoning | CMATH (test) | Accuracy89.7 | 25 | |
| Math | CMATH | Score96.9 | 10 | |
| Mathematical Reasoning | CMath | Pass@186 | 9 | |
| Chinese-language ability | CMATH | Accuracy84.8 | 6 | |
| Mathematical Problem Solving | CMATH | Accuracy71.7 | 4 | |
| Chinese Mathematical Reasoning | CMath | CMath Score40.5 | 1 |