REPLUG: Retrieval-Augmented Black-Box Language Models

About

We introduce REPLUG, a retrieval-augmented language modeling framework that treats the language model (LM) as a black box and augments it with a tuneable retrieval model. Unlike prior retrieval-augmented LMs that train language models with special cross attention mechanisms to encode the retrieved text, REPLUG simply prepends retrieved documents to the input for the frozen black-box LM. This simple design can be easily applied to any existing retrieval and language models. Furthermore, we show that the LM can be used to supervise the retrieval model, which can then find documents that help the LM make better predictions. Our experiments demonstrate that REPLUG with the tuned retriever significantly improves the performance of GPT-3 (175B) on language modeling by 6.3%, as well as the performance of Codex on five-shot MMLU by 5.1%.

Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih• 2023

Related benchmarks

Task	Dataset	Result
Multi-hop Question Answering	2WikiMultihopQA	EM40.23	559
Multitask Language Understanding	MMLU (test)	Accuracy71.8	312
Multi-hop Question Answering	HotpotQA	--	294
Multi-hop Question Answering	MuSiQue	EM18.9	209
Multi-hop Question Answering	HotpotQA	Exact Match (EM)42.63	150
Multi-hop QA	HotpotQA	Exact Match32.8	143
Open Question Answering	Natural Questions (NQ) (test)	Exact Match (EM)29.4	134
Question Answering	NQ (test)	--	133
Multi-hop Question Answering	Bamboogle	Exact Match41.6	128
Information Retrieval	BEIR	SciFact0.737	120

Showing 10 of 55 rows

Other info

Follow for update

@wizwand_team Discord