Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Retrieval-Augmented Generation for Large Language Models: A Survey

About

Large Language Models (LLMs) showcase impressive capabilities but encounter challenges like hallucination, outdated knowledge, and non-transparent, untraceable reasoning processes. Retrieval-Augmented Generation (RAG) has emerged as a promising solution by incorporating knowledge from external databases. This enhances the accuracy and credibility of the generation, particularly for knowledge-intensive tasks, and allows for continuous knowledge updates and integration of domain-specific information. RAG synergistically merges LLMs' intrinsic knowledge with the vast, dynamic repositories of external databases. This comprehensive review paper offers a detailed examination of the progression of RAG paradigms, encompassing the Naive RAG, the Advanced RAG, and the Modular RAG. It meticulously scrutinizes the tripartite foundation of RAG frameworks, which includes the retrieval, the generation and the augmentation techniques. The paper highlights the state-of-the-art technologies embedded in each of these critical components, providing a profound understanding of the advancements in RAG systems. Furthermore, this paper introduces up-to-date evaluation framework and benchmark. At the end, this article delineates the challenges currently faced and points out prospective avenues for research and development.

Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang• 2023

Related benchmarks

TaskDatasetResultRank
Multi-hop Question Answering2WikiMultihopQA--
387
Multi-hop Question AnsweringHotpotQA
F1 Score73.07
294
Question Answering2Wiki
F124.27
152
Question AnsweringPopQA
EM37.6
88
Question Answering2WikiMultiHopQA (test)
F124.4
81
Question AnsweringNatural Questions (NQ) (test)--
68
Graph ReasoningGRBENCH Legal
QwenScore34.4
32
Graph ReasoningGRBENCH Academic
QwenScore0.146
32
Graph ReasoningGRBENCH E-Commerce
QwenScore0.25
32
Graph ReasoningGRBENCH Literature
QwenScore19.6
32
Showing 10 of 45 rows

Other info

Follow for update