| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Mathematical Reasoning | MathInstruct Scenario 1 | Accuracy68.4 | 53 | |
| Mathematical Reasoning | MathInstruct Scenario 4 | Accuracy82.6 | 8 | |
| Mathematical Reasoning | MathInstruct Scenario 3 | Accuracy83.2 | 8 | |
| Mathematical Reasoning | MathInstruct Scenario 2 | Accuracy83.8 | 8 |