Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation

About

Large language models (LLMs) augmented with retrieval systems have demonstrated significant potential in handling knowledge-intensive tasks. However, these models often struggle with unfaithfulness issues, generating outputs that either ignore the retrieved context or inconsistently blend it with the LLM`s parametric knowledge. This issue is particularly severe in cases of knowledge conflict, where the retrieved context conflicts with the model`s parametric knowledge. While existing faithful RAG approaches enforce strict context adherence through well-designed prompts or modified decoding strategies, our analysis reveals a critical limitation: they achieve faithfulness by forcibly suppressing the model`s parametric knowledge, which undermines the model`s internal knowledge structure and increases the risk of misinterpreting the context. To this end, this paper proposes FaithfulRAG, a novel framework that resolves knowledge conflicts by explicitly modeling discrepancies between the model`s parametric knowledge and retrieved context. Specifically, FaithfulRAG identifies conflicting knowledge at the fact level and designs a self-thinking process, allowing LLMs to reason about and integrate conflicting facts before generating responses. Extensive experiments demonstrate that our method outperforms state-of-the-art methods. The code is available at https://github.com/DeepLearnXMU/Faithful-RAG

Qinggang Zhang, Zhishang Xiang, Yilin Xiao, Le Wang, Junhui Li, Xinrun Wang, Jinsong Su• 2025

Related benchmarks

TaskDatasetResultRank
Faithfulness EvaluationFaithEval
F1 Score70.2
42
Multiple-choice Question AnsweringConFiQA MC
F1 Score72.3
42
Question AnsweringMuSiQue
Accuracy (ACC)79.9
36
Question AnsweringSQuAD KRE-curated version
F1 Score66.2
36
Open-ended Question AnsweringConFiQA (test)
F1 Score75.3
36
Multi-step Reasoning Question AnsweringConFiQA MR (test)
F1 Score62.4
36
Question AnsweringMuSiQue
LLM Accuracy52.9
34
Question AnsweringFaithEval
Accuracy81.7
27
Question AnsweringSQuAD
Accuracy (ACC)86.3
27
Question AnsweringRealtimeQA
Accuracy84.1
27
Showing 10 of 23 rows

Other info

Code

Follow for update