HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models

About

Large Language Models (LLMs) often generate hallucinations, producing outputs that are contextually inaccurate or factually incorrect. We introduce HICD, a novel method designed to induce hallucinations for contrastive decoding to mitigate hallucinations. Unlike existing contrastive decoding methods, HICD selects attention heads crucial to the model's prediction as inducing heads, then induces hallucinations by dispersing attention of these inducing heads and compares the hallucinated outputs with the original outputs to obtain the final result. Our approach significantly improves performance on tasks requiring contextual faithfulness, such as context completion, reading comprehension, and question answering. It also improves factuality in tasks requiring accurate knowledge recall. We demonstrate that our inducing heads selection and attention dispersion method leads to more "contrast-effective" hallucinations for contrastive decoding, outperforming other hallucination-inducing methods. Our findings provide a promising strategy for reducing hallucinations by inducing hallucinations in a controlled manner, enhancing the performance of LLMs in a wide range of tasks.

Xinyan Jiang, Hang Ye, Yongxin Zhu, Xiaoying Zheng, Zikang Chen, Jun Gong• 2025

Related benchmarks

Task	Dataset	Result
Commonsense Reasoning	HellaSwag	Accuracy87.1	1896
Question Answering	OpenBookQA	Accuracy63.4	465
Sentence Completion	HellaSwag	Accuracy84.33	364
Reading Comprehension	RACE high	Accuracy46.68	295
Reading Comprehension	RACE mid	Accuracy59.96	196
Factuality Evaluation	TruthfulQA	MC265.63	103
Multiple-Choice	TruthfulQA	MC1 Accuracy49.75	83
Reading Comprehension	RACE	RACE Middle Score68.9	21
Factuality Evaluation	FACTOR	--	18
Factual Consistency	FACTOR	Factual Consistency (Wiki)64.43	14

Showing 10 of 13 rows

Other info

Code

Follow for update

@wizwand_team Discord