Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models

About

We present Step-Back Prompting, a simple prompting technique that enables LLMs to do abstractions to derive high-level concepts and first principles from instances containing specific details. Using the concepts and principles to guide reasoning, LLMs significantly improve their abilities in following a correct reasoning path towards the solution. We conduct experiments of Step-Back Prompting with PaLM-2L, GPT-4 and Llama2-70B models, and observe substantial performance gains on various challenging reasoning-intensive tasks including STEM, Knowledge QA, and Multi-Hop Reasoning. For instance, Step-Back Prompting improves PaLM-2L performance on MMLU (Physics and Chemistry) by 7% and 11% respectively, TimeQA by 27%, and MuSiQue by 7%.

Huaixiu Steven Zheng, Swaroop Mishra, Xinyun Chen, Heng-Tze Cheng, Ed H. Chi, Quoc V Le, Denny Zhou• 2023

Related benchmarks

Task	Dataset	Result
Question Answering	OpenBookQA	Accuracy93.8	305
Multitask Language Understanding	MMLU-Pro	Accuracy45.5	248
Mathematical Reasoning	MATH 500	Accuracy26	221
Medical Question Answering	MedQA	Accuracy68.42	154
Mathematical Reasoning	AGIEval MATH	Accuracy47.5	99
Question Answering	GPQA (test)	Accuracy42.4	65
Mathematical Problem Solving	Gaokao MathQA	Accuracy64.5	60
Math Reasoning	MATH (test)	Accuracy90.4	59
Multistep Reasoning	MuSR	Accuracy70	53
Mathematical Reasoning	Minerva (test)	Acc38.6	46

Showing 10 of 50 rows

Other info

Follow for update

@wizwand_team Discord