Learning to Retrieve In-Context Examples for Large Language Models

About

Large language models (LLMs) have demonstrated their ability to learn in-context, allowing them to perform various tasks based on a few input-output examples. However, the effectiveness of in-context learning is heavily reliant on the quality of the selected examples. In this paper, we propose a novel framework to iteratively train dense retrievers that can identify high-quality in-context examples for LLMs. Our framework initially trains a reward model based on LLM feedback to evaluate the quality of candidate examples, followed by knowledge distillation to train a bi-encoder based dense retriever. Our experiments on a suite of $30$ tasks demonstrate that our framework significantly enhances in-context learning performance. Furthermore, we show the generalization ability of our framework to unseen tasks during training. An in-depth analysis reveals that our model improves performance by retrieving examples with similar patterns, and the gains are consistent across LLMs of varying sizes. The code and data are available at https://github.com/microsoft/LMOps/tree/main/llm_retriever .

Liang Wang, Nan Yang, Furu Wei• 2023

Related benchmarks

Task	Dataset	Result
Commonsense Reasoning	HellaSwag	Accuracy74.6	1896
Natural Language Inference	RTE	Accuracy61.7	590
Reading Comprehension	BoolQ	Accuracy74.9	279
Common Sense Reasoning	COPA	Accuracy85	256
Topic Classification	AG-News	Accuracy92.4	225
Natural Language Inference	SNLI	Accuracy80	196
Sentiment Analysis	SST-2	Accuracy93.4	165
Sentiment Analysis	SST-2 (test)	Accuracy94.3	144
Natural language generation	E2E (test)	ROUGE-L56.4	100
Reasoning	OpenBookQA	Accuracy50.8	92

Showing 10 of 32 rows

Other info

Follow for update

@wizwand_team Discord