Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning Like Program Executors

About

Reasoning over natural language is a long-standing goal for the research community. However, studies have shown that existing language models are inadequate in reasoning. To address the issue, we present POET, a novel reasoning pre-training paradigm. Through pre-training language models with programs and their execution results, POET empowers language models to harvest the reasoning knowledge possessed by program executors via a data-driven approach. POET is conceptually simple and can be instantiated by different kinds of program executors. In this paper, we showcase two simple instances POET-Math and POET-Logic, in addition to a complex instance, POET-SQL. Experimental results on six benchmarks demonstrate that POET can significantly boost model performance in natural language reasoning, such as numerical reasoning, logical reasoning, and multi-hop reasoning. POET opens a new gate on reasoning-enhancement pre-training, and we hope our analysis would shed light on the future research of reasoning like program executors.

Xinyu Pi, Qian Liu, Bei Chen, Morteza Ziyadi, Zeqi Lin, Qiang Fu, Yan Gao, Jian-Guang Lou, Weizhu Chen• 2022

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningSVAMP (test)
Accuracy57.4
233
Arithmetic ReasoningSVAMP
Accuracy57.4
48
Question AnsweringHotpotQA (test)
EM0.687
12
Question AnsweringDROP (dev)
EM78
10
Natural Language InferenceEQUATE (test)
Exact Match67.5
5
Numerical ReasoningDROP span-subset (dev)
EM79.8
4
Numerical ReasoningHotpotQA (test)
EM68.7
4
Numerical ReasoningTAT-QA (dev)
Exact Match (EM)59.1
4
Numerical ReasoningEQUATE (test)
Exact Match67.5
4
Numerical ReasoningSVAMP (test)
Exact Match (EM)57.4
4
Showing 10 of 12 rows

Other info

Code

Follow for update