Atlas: Few-shot Learning with Retrieval Augmented Language Models

About

Large language models have shown impressive few-shot results on a wide range of tasks. However, when knowledge is key for such results, as is the case for tasks such as question answering and fact checking, massive parameter counts to store knowledge seem to be needed. Retrieval augmented models are known to excel at knowledge intensive tasks without the need for as many parameters, but it is unclear whether they work in few-shot settings. In this work we present Atlas, a carefully designed and pre-trained retrieval augmented language model able to learn knowledge intensive tasks with very few training examples. We perform evaluations on a wide range of tasks, including MMLU, KILT and NaturalQuestions, and study the impact of the content of the document index, showing that it can easily be updated. Notably, Atlas reaches over 42% accuracy on Natural Questions using only 64 examples, outperforming a 540B parameters model by 3% despite having 50x fewer parameters.

Gautier Izacard, Patrick Lewis, Maria Lomeli, Lucas Hosseini, Fabio Petroni, Timo Schick, Jane Dwivedi-Yu, Armand Joulin, Sebastian Riedel, Edouard Grave• 2022

Related benchmarks

Task	Dataset	Result
Multi-hop Question Answering	2WikiMultihopQA	EM38.1	559
Multitask Language Understanding	MMLU (test)	Accuracy66	312
Multi-hop Question Answering	HotpotQA	F1 Score55.3	294
Question Answering	TriviaQA	--	117
Question Answering	Natural Questions (NQ) (test)	Exact Match60.4	77
Multi-hop Question Answering	Multi-hop RAG	F152.3	77
Fact Verification	FEVER	Accuracy0.77	72
Question Answering	NQ (Natural Questions)	EM26.7	70
Open-domain Question Answering	NaturalQ-Open (test)	EM60.4	37
Fact Verification	FEVER (test)	--	32

Showing 10 of 27 rows

Other info

Code

Follow for update

@wizwand_team Discord