Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering

About

Generative models for open domain question answering have proven to be competitive, without resorting to external knowledge. While promising, this approach requires to use models with billions of parameters, which are expensive to train and query. In this paper, we investigate how much these models can benefit from retrieving text passages, potentially containing evidence. We obtain state-of-the-art results on the Natural Questions and TriviaQA open benchmarks. Interestingly, we observe that the performance of this method significantly improves when increasing the number of retrieved passages. This is evidence that generative models are good at aggregating and combining evidence from multiple passages.

Gautier Izacard, Edouard Grave• 2020

Related benchmarks

Task	Dataset	Result
Open Question Answering	Natural Questions (NQ) (test)	Exact Match (EM)51.4	134
Question Answering	NQ (test)	EM Accuracy51.4	133
Open-domain Question Answering	TriviaQA	EM67.6	88
Question Answering	NarrativeQA (test)	ROUGE-L30.8	88
Open-domain Question Answering	Natural Questions (NQ)	Exact Match (EM)51.4	82
Open-domain Question Answering	TriviaQA (test)	Exact Match68.7	80
Question Answering	Natural Questions (NQ) (test)	Exact Match51.5	77
Question Answering	Natural Questions (test)	EM51.4	72
End-to-end Open-Domain Question Answering	NQ (test)	Exact Match (EM)51.9	59
Open-domain Question Answering	TriviaQA open (test)	EM67.6	59

Showing 10 of 63 rows

Other info

Code

Follow for update

@wizwand_team Discord