Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation

About

Large Language Models (LLMs) exhibit remarkable capabilities but are prone to generating inaccurate or hallucinatory responses. This limitation stems from their reliance on vast pretraining datasets, making them susceptible to errors in unseen scenarios. To tackle these challenges, Retrieval-Augmented Generation (RAG) addresses this by incorporating external, relevant documents into the response generation process, thus leveraging non-parametric knowledge alongside LLMs' in-context learning abilities. However, existing RAG implementations primarily focus on initial input for context retrieval, overlooking the nuances of ambiguous or complex queries that necessitate further clarification or decomposition for accurate responses. To this end, we propose learning to Refine Query for Retrieval Augmented Generation (RQ-RAG) in this paper, endeavoring to enhance the model by equipping it with capabilities for explicit rewriting, decomposition, and disambiguation. Our experimental results indicate that our method, when applied to a 7B Llama2 model, surpasses the previous state-of-the-art (SOTA) by an average of 1.9\% across three single-hop QA datasets, and also demonstrates enhanced performance in handling complex, multi-hop QA datasets. Our code is available at https://github.com/chanchimin/RQ-RAG.

Chi-Min Chan, Chunpu Xu, Ruibin Yuan, Hongyin Luo, Wei Xue, Yike Guo, Jie Fu• 2024

Related benchmarks

TaskDatasetResultRank
Question AnsweringARC Challenge
Accuracy68.9
749
Question AnsweringOBQA
Accuracy83.5
276
Multi-hop Question AnsweringHotpotQA
F1 Score61.9
221
Multi-hop Question AnsweringHotpotQA (test)
F136.3
198
Question AnsweringPopQA
Accuracy64.2
186
Multi-hop Question Answering2WikiMultiHopQA (test)
EM26.8
143
Question Answering2Wiki
F157.5
75
Question AnsweringARC-C
Accuracy0.68
68
Multi-hop Question AnsweringBamboogle (test)
EM24.8
46
Multi-hop Question Answering2Wiki
F1 Score44.3
41
Showing 10 of 33 rows

Other info

Follow for update