A Simple Method for Commonsense Reasoning

About

Commonsense reasoning is a long-standing challenge for deep learning. For example, it is difficult to use neural networks to tackle the Winograd Schema dataset (Levesque et al., 2011). In this paper, we present a simple method for commonsense reasoning with neural networks, using unsupervised learning. Key to our method is the use of language models, trained on a massive amount of unlabled data, to score multiple choice questions posed by commonsense reasoning tests. On both Pronoun Disambiguation and Winograd Schema challenges, our models outperform previous state-of-the-art methods by a large margin, without using expensive annotated knowledge bases or hand-engineered features. We train an array of large RNN language models that operate at word or character level on LM-1-Billion, CommonCrawl, SQuAD, Gutenberg Books, and a customized corpus for this task and show that diversity of training data plays an important role in test performance. Further analysis also shows that our system successfully discovers important features of the context that decide the correct answer, indicating a good grasp of commonsense knowledge.

Trieu H. Trinh, Quoc V. Le• 2018

Related benchmarks

Task	Dataset	Result
Pronoun Resolution	WinoGrande	Accuracy50.9	58
Coreference Resolution	GAP (test)	Overall F176	53
Coreference Resolution	Winograd WSC273 (test)	Accuracy70	34
Pronoun Disambiguation	Winograd Schema Challenge	Accuracy63.7	27
Pronoun Disambiguation Problem	PDP 2016 (test)	Accuracy70	21
Commonsense Reasoning	Winograd Schema Challenge (WSC) (test)	Accuracy61.5	17
Pronoun Disambiguation	WSC (test)	--	14
Pronoun Disambiguation	PDP-60	Accuracy66.7	12
Coreference Resolution	Winogender (WG) (test)	Accuracy78.8	11
Pronoun Disambiguation Problem	PDP60 (test)	Accuracy70	9

Showing 10 of 13 rows

Other info

Code

Follow for update

@wizwand_team Discord