Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning to Filter Context for Retrieval-Augmented Generation

About

On-the-fly retrieval of relevant knowledge has proven an essential element of reliable systems for tasks such as open-domain question answering and fact verification. However, because retrieval systems are not perfect, generation models are required to generate outputs given partially or entirely irrelevant passages. This can cause over- or under-reliance on context, and result in problems in the generated output such as hallucinations. To alleviate these problems, we propose FILCO, a method that improves the quality of the context provided to the generator by (1) identifying useful context based on lexical and information-theoretic approaches, and (2) training context filtering models that can filter retrieved contexts at test time. We experiment on six knowledge-intensive tasks with FLAN-T5 and LLaMa2, and demonstrate that our method outperforms existing approaches on extractive question answering (QA), complex multi-hop and long-form QA, fact verification, and dialog generation tasks. FILCO effectively improves the quality of context, whether or not it supports the canonical output.

Zhiruo Wang, Jun Araki, Zhengbao Jiang, Md Rizwan Parvez, Graham Neubig• 2023

Related benchmarks

TaskDatasetResultRank
Multi-hop Question AnsweringHotpotQA (test)
F158.15
198
Open Question AnsweringNatural Questions (NQ) (test)
Exact Match (EM)27.3
134
Open-domain Question AnsweringNatural Questions (test)
EM44.79
18
Open-domain QATriviaQA (TQA) (test)
EM60.4
10
Open-domain QAHotpotQA (HQA) (test)
Exact Match0.239
10
Multi-hop Question AnsweringMuSiQue (OOD)
EM8.36
6
Multi-hop Question Answering2WikiQA OOD evaluation
EM27.5
6
Showing 7 of 7 rows

Other info

Follow for update