| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Mathematical Reasoning | MAWPS | Accuracy98.5 | 219 | |
| Mathematical Reasoning | MAWPS (test) | Accuracy96.5 | 87 | |
| Mathematical Reasoning | MAWPS | Pass@197.6 | 28 | |
| Mathematical Equation Generation | MAWPS (5-fold cross-validation) | Accuracy (5-fold)92.5 | 23 | |
| Math Word Problem Solving | MAWPS (5-fold cross-val) | Accuracy92.3 | 21 | |
| Arithmetic Reasoning | MAWPS | Accuracy93.5 | 20 | |
| Math Word Problem solving | MAWPS original (whole dataset) | Value Accuracy91 | 14 | |
| Math Word Problem solving | MAWPS English (test) | Accuracy93 | 10 | |
| Arithmetic Reasoning | MAWPS (5-fold cross val) | Accuracy94.3 | 10 | |
| Mathematical Reasoning | MAWPS Out-of-Distribution (test) | Accuracy96.5 | 5 |