Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

REPLUG: Retrieval-Augmented Black-Box Language Models

About

We introduce REPLUG, a retrieval-augmented language modeling framework that treats the language model (LM) as a black box and augments it with a tuneable retrieval model. Unlike prior retrieval-augmented LMs that train language models with special cross attention mechanisms to encode the retrieved text, REPLUG simply prepends retrieved documents to the input for the frozen black-box LM. This simple design can be easily applied to any existing retrieval and language models. Furthermore, we show that the LM can be used to supervise the retrieval model, which can then find documents that help the LM make better predictions. Our experiments demonstrate that REPLUG with the tuned retriever significantly improves the performance of GPT-3 (175B) on language modeling by 6.3%, as well as the performance of Codex on five-shot MMLU by 5.1%.

Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih• 2023

Related benchmarks

TaskDatasetResultRank
Multi-hop Question Answering2WikiMultihopQA
EM40.23
559
Multitask Language UnderstandingMMLU (test)
Accuracy71.8
312
Multi-hop Question AnsweringHotpotQA--
294
Multi-hop Question AnsweringMuSiQue
EM18.9
209
Multi-hop Question AnsweringHotpotQA
Exact Match (EM)42.63
150
Multi-hop QAHotpotQA
Exact Match32.8
143
Open Question AnsweringNatural Questions (NQ) (test)
Exact Match (EM)29.4
134
Question AnsweringNQ (test)--
133
Multi-hop Question AnsweringBamboogle
Exact Match41.6
128
Information RetrievalBEIR
SciFact0.737
120
Showing 10 of 55 rows

Other info

Follow for update