How Much Knowledge Can You Pack Into the Parameters of a Language Model?

About

It has recently been observed that neural language models trained on unstructured text can implicitly store and retrieve knowledge using natural language queries. In this short paper, we measure the practical utility of this approach by fine-tuning pre-trained models to answer questions without access to any external context or knowledge. We show that this approach scales with model size and performs competitively with open-domain systems that explicitly retrieve answers from an external knowledge source when answering questions. To facilitate reproducibility and future work, we release our code and trained models at https://goo.gle/t5-cbqa.

Adam Roberts, Colin Raffel, Noam Shazeer• 2020

Related benchmarks

Task	Dataset	Result
Commonsense Reasoning	PIQA	Accuracy68.48	757
Common Sense Reasoning	COPA	Accuracy72.1	256
Question Answering	2Wiki	--	241
Question Answering	TriviaQA	Accuracy29.1	238
Commonsense Reasoning	OBQA	Accuracy58.6	187
Commonsense Reasoning	SocialIQA	Accuracy65.5	158
Open Question Answering	Natural Questions (NQ) (test)	Exact Match (EM)36.6	134
Question Answering	NQ	Accuracy26.3	123
Question Answering	PopQA	Accuracy20.73	103
Open-domain Question Answering	TriviaQA	EM42.3	88

Showing 10 of 54 rows

Other info

Follow for update

@wizwand_team Discord