Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Generator-Retriever-Generator Approach for Open-Domain Question Answering

About

Open-domain question answering (QA) tasks usually require the retrieval of relevant information from a large corpus to generate accurate answers. We propose a novel approach called Generator-Retriever-Generator (GRG) that combines document retrieval techniques with a large language model (LLM), by first prompting the model to generate contextual documents based on a given question. In parallel, a dual-encoder network retrieves documents that are relevant to the question from an external corpus. The generated and retrieved documents are then passed to the second LLM, which generates the final answer. By combining document retrieval and LLM generation, our approach addresses the challenges of open-domain QA, such as generating informative and contextually relevant answers. GRG outperforms the state-of-the-art generate-then-read and retrieve-then-read pipelines (GENREAD and RFiD) improving their performance by at least by +5.2, +4.2, and +1.6 on TriviaQA, NQ, and WebQ datasets, respectively. We provide code, datasets, and checkpoints at https://github.com/abdoelsayed2016/GRG.

Abdelrahman Abdallah, Adam Jatowt• 2023

Related benchmarks

TaskDatasetResultRank
Multi-hop Question AnsweringHotpotQA (test)
F150.21
198
Multi-hop Question Answering2WikiMultiHopQA (test)--
143
Multi-hop Question AnsweringMuSiQue (test)
F124.87
111
Multi-hop Question AnsweringStrategyQA (test)
Accuracy75.21
26
Question Answering CorrectnessHuman Evaluation (120 randomly sampled cases from HotpotQA, 2WikiMultiHopQA, MuSiQue, and DuReader)
Accuracy49.71
4
Showing 5 of 5 rows

Other info

Follow for update