DeepNote: Note-Centric Deep Retrieval-Augmented Generation

About

Retrieval-Augmented Generation (RAG) mitigates factual errors and hallucinations in Large Language Models (LLMs) for question-answering (QA) by incorporating external knowledge. However, existing adaptive RAG methods rely on LLMs to predict retrieval timing and directly use retrieved information for generation, often failing to reflect real information needs and fully leverage retrieved knowledge. We develop DeepNote, an adaptive RAG framework that achieves in-depth and robust exploration of knowledge sources through note-centric adaptive retrieval. DeepNote employs notes as carriers for refining and accumulating knowledge. During in-depth exploration, it uses these notes to determine retrieval timing, formulate retrieval queries, and iteratively assess knowledge growth, ultimately leveraging the best note for answer generation. Extensive experiments and analyses demonstrate that DeepNote significantly outperforms all baselines (+10.2% to +20.1%) and exhibits the ability to gather knowledge with both high density and quality. Additionally, DPO further improves the performance of DeepNote. The code and data are available at https://github.com/thunlp/DeepNote.

Ruobing Wang, Qingfei Zhao, Yukun Yan, Daren Zha, Yuxuan Chen, Shi Yu, Zhenghao Liu, Yixuan Wang, Shuo Wang, Xu Han, Zhiyuan Liu, Maosong Sun• 2024

Related benchmarks

Task	Dataset	Result
Multi-hop Question Answering	HotpotQA (test)	F158.4	311
Multi-hop Question Answering	HotpotQA	F1 Score46.6	294
Multi-hop Question Answering	2WikiMultiHopQA (test)	EM43.2	226
Multi-hop Question Answering	2WikiMQA	F1 Score64.4	161
Single-hop Question Answering	TriviaQA	--	133
Multi-hop Question Answering	MuSiQue (test)	F124.2	128
Multi-hop Question Answering	HotpotQA	F159.97	79
Question Answering	2WikiMQA	--	66
Multi-hop Question Answering	Bamboogle	EM27.2	51
Multi-hop Question Answering	MuSiQue	F130.9	38

Showing 10 of 30 rows

Other info

Follow for update

@wizwand_team Discord