Learning Multi-Step Reasoning by Solving Arithmetic Tasks
About
Mathematical reasoning is regarded as a necessary ability for Language Models (LMs). Recent works demonstrate large LMs' impressive performance in solving math problems. The success is attributed to their Chain-of-Thought (CoT) reasoning abilities, i.e., the ability to decompose complex questions into step-by-step reasoning chains, but such ability seems only to emerge from models with abundant parameters. This work investigates how to incorporate relatively small LMs with the capabilities of multi-step reasoning. We propose to inject such abilities by continually pre-training LMs on a synthetic dataset MsAT which is composed of Multi-step Arithmetic Tasks. Our experiments on four math word problem datasets show the effectiveness of the proposed method in enhancing LMs' math reasoning abilities.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Mathematical Reasoning | SVAMP (test) | Accuracy48.9 | 233 | |
| Arithmetic Reasoning | MAWPS (5-fold cross val) | Accuracy94.3 | 10 | |
| Math Word Problem Solving | ASDiv-A (5-fold cross-val) | Accuracy87.5 | 7 | |
| Math Word Problem Solving | SVAMP hard (test) | Accuracy48.2 | 6 |