Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models
About
We present Step-Back Prompting, a simple prompting technique that enables LLMs to do abstractions to derive high-level concepts and first principles from instances containing specific details. Using the concepts and principles to guide reasoning, LLMs significantly improve their abilities in following a correct reasoning path towards the solution. We conduct experiments of Step-Back Prompting with PaLM-2L, GPT-4 and Llama2-70B models, and observe substantial performance gains on various challenging reasoning-intensive tasks including STEM, Knowledge QA, and Multi-Hop Reasoning. For instance, Step-Back Prompting improves PaLM-2L performance on MMLU (Physics and Chemistry) by 7% and 11% respectively, TimeQA by 27%, and MuSiQue by 7%.
Huaixiu Steven Zheng, Swaroop Mishra, Xinyun Chen, Heng-Tze Cheng, Ed H. Chi, Quoc V Le, Denny Zhou• 2023
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Question Answering | OpenBookQA | Accuracy93.8 | 305 | |
| Multitask Language Understanding | MMLU-Pro | Accuracy45.5 | 248 | |
| Mathematical Reasoning | MATH 500 | Accuracy26 | 221 | |
| Medical Question Answering | MedQA | Accuracy68.42 | 154 | |
| Mathematical Reasoning | AGIEval MATH | Accuracy47.5 | 99 | |
| Question Answering | GPQA (test) | Accuracy42.4 | 65 | |
| Mathematical Problem Solving | Gaokao MathQA | Accuracy64.5 | 60 | |
| Math Reasoning | MATH (test) | Accuracy90.4 | 59 | |
| Multistep Reasoning | MuSR | Accuracy70 | 53 | |
| Mathematical Reasoning | Minerva (test) | Acc38.6 | 46 |
Showing 10 of 50 rows