Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Removal of Hallucination on Hallucination: Debate-Augmented RAG

About

Retrieval-Augmented Generation (RAG) enhances factual accuracy by integrating external knowledge, yet it introduces a critical issue: erroneous or biased retrieval can mislead generation, compounding hallucinations, a phenomenon we term Hallucination on Hallucination. To address this, we propose Debate-Augmented RAG (DRAG), a training-free framework that integrates Multi-Agent Debate (MAD) mechanisms into both retrieval and generation stages. In retrieval, DRAG employs structured debates among proponents, opponents, and judges to refine retrieval quality and ensure factual reliability. In generation, DRAG introduces asymmetric information roles and adversarial debates, enhancing reasoning robustness and mitigating factual inconsistencies. Evaluations across multiple tasks demonstrate that DRAG improves retrieval reliability, reduces RAG-induced hallucinations, and significantly enhances overall factual accuracy. Our code is available at https://github.com/Huenao/Debate-Augmented-RAG.

Wentao Hu, Wengyu Zhang, Yiyang Jiang, Chen Jason Zhang, Xiaoyong Wei, Qing Li• 2025

Related benchmarks

TaskDatasetResultRank
Question Answering2Wiki
F136.97
152
Multi-hop Question Answering2Wiki
Exact Match28.8
152
Question AnsweringPopQA
EM38.6
88
Multi-hop Question AnsweringMulti-hop RAG
F130.2
77
Question AnsweringNQ
EM36.8
69
RetrievalHotpotQA
R@588.3
36
RetrievalPopQA
R@561.8
19
Retrieval2Wiki
Recall@584.4
19
RetrievalNQ
R@564.4
19
Question AnsweringStrategyQA
Exact Match (EM)69.2
16
Showing 10 of 14 rows

Other info

Follow for update