| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Mathematical Reasoning | MetaMathQA | Score89.57 | 54 | |
| Instruction Fine-tuning | MetaMathQA Fine-tuning Evaluation Suite (ARC-C, PIQA, MMLU, HE, GSM8K) (test) | ARC-C Accuracy51.45 | 32 | |
| Mathematical Reasoning | MetaMathQA (test) | Accuracy53.37 | 26 | |
| Mathematical Question Answering | MetaMathQA (final) | Accuracy92.52 | 9 | |
| Mathematical Reasoning | MetaMathQA 100k-samples | Loss0.142 | 5 |