Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation

About

Large language models (LLMs) augmented with retrieval systems have demonstrated significant potential in handling knowledge-intensive tasks. However, these models often struggle with unfaithfulness issues, generating outputs that either ignore the retrieved context or inconsistently blend it with the LLM`s parametric knowledge. This issue is particularly severe in cases of knowledge conflict, where the retrieved context conflicts with the model`s parametric knowledge. While existing faithful RAG approaches enforce strict context adherence through well-designed prompts or modified decoding strategies, our analysis reveals a critical limitation: they achieve faithfulness by forcibly suppressing the model`s parametric knowledge, which undermines the model`s internal knowledge structure and increases the risk of misinterpreting the context. To this end, this paper proposes FaithfulRAG, a novel framework that resolves knowledge conflicts by explicitly modeling discrepancies between the model`s parametric knowledge and retrieved context. Specifically, FaithfulRAG identifies conflicting knowledge at the fact level and designs a self-thinking process, allowing LLMs to reason about and integrate conflicting facts before generating responses. Extensive experiments demonstrate that our method outperforms state-of-the-art methods. The code is available at https://github.com/DeepLearnXMU/Faithful-RAG

Qinggang Zhang, Zhishang Xiang, Yilin Xiao, Le Wang, Junhui Li, Xinrun Wang, Jinsong Su• 2025

Related benchmarks

TaskDatasetResultRank
Question AnsweringMuSiQue
Accuracy (ACC)79.9
36
Question AnsweringFaithEval
Accuracy81.7
27
Question AnsweringSQuAD
Accuracy (ACC)86.3
27
Question AnsweringRealtimeQA
Accuracy84.1
27
Question AnsweringMuSiQue entity-level knowledge conflict (test)
Mean Rank7.7
24
Question AnsweringSQuAD entity-level knowledge conflict (test)
MR9.7
24
Question AnsweringMuSiQue
LLM Accuracy52.9
20
Question AnsweringHotpotQA
LLM Accuracy76.9
20
Long-form Question AnsweringGraphRAG-Bench Med
LLM Accuracy75.4
20
Long-form Question AnsweringNovel GraphRAG-Bench
LLM-Acc60.7
20
Showing 10 of 18 rows

Other info

Code

Follow for update