Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CoCoA: Confidence and Context-Aware Adaptive Decoding for Resolving Knowledge Conflicts in Large Language Models

About

Faithful generation in large language models (LLMs) is challenged by knowledge conflicts between parametric memory and external context. Existing contrastive decoding methods tuned specifically to handle conflict often lack adaptability and can degrade performance in low conflict settings. We introduce CoCoA (Confidence- and Context-Aware Adaptive Decoding), a novel token-level algorithm for principled conflict resolution and enhanced faithfulness. CoCoA resolves conflict by utilizing confidence-aware measures (entropy gap and contextual peakedness) and the generalized divergence between the parametric and contextual distributions. Crucially, CoCoA maintains strong performance even in low conflict settings. Extensive experiments across multiple LLMs on diverse Question Answering (QA), Summarization, and Long-Form Question Answering (LFQA) benchmarks demonstrate CoCoA's state-of-the-art performance over strong baselines like AdaCAD. It yields significant gains in QA accuracy, up to 9.2 points on average compared to the strong baseline AdaCAD, and improves factuality in summarization and LFQA by up to 2.5 points on average across key benchmarks. Additionally, it demonstrates superior sensitivity to conflict variations. CoCoA enables more informed, context-aware, and ultimately more faithful token generation.

Anant Khandelwal, Manish Gupta, Puneet Agrawal• 2025

Related benchmarks

TaskDatasetResultRank
Question AnsweringPopQA
Accuracy84.12
103
Table Question AnsweringTabMWP
Accuracy57.62
97
Question AnsweringNatural Questions (NQ) (test)
Exact Match45.1
77
Question AnsweringNQ-Open (val)
Accuracy43.8
46
Question AnsweringNQ-Swap
Accuracy70.88
38
Factuality EvaluationFactScore
Pairwise Score79.5
24
Question AnsweringHELPFUL
Accuracy87.54
18
Question AnsweringTabMWP
Accuracy53.2
18
Dialogue SummarizationTofuEval
ToFuEval Score81.04
18
Question AnsweringRestate hard
Accuracy89.6
18
Showing 10 of 19 rows

Other info

Follow for update