Mitigating Hallucinations in Large Vision-Language Models via Causal Route Gating

About

Large vision-language models (LVLMs) often hallucinate content that is fluent yet unsupported by the image, limiting their reliability in real-world deployment. We show that a key failure mode arises from route competition: even when visual tokens receive attention, the final token decision can be dominated by the textual pathway, causing the decoder to follow linguistic priors over visual evidence. To mitigate this, we propose a training-free, decision-aligned intervention that decomposes each attention head into a visual route and a text route, and estimates their token-level effects using an efficient one-forward/one-gradient approximation. These estimates reveal route conflict within heads and identify prior-dominant ones, enabling selective suppression of only the text route while keeping the visual route intact. Across five benchmarks spanning discriminative and generative settings, our method consistently reduces hallucination-related errors across models with limited impact on overall multimodal performance, while incurring a modest inference-time overhead.

Zhe Cheng, Wenyu Chen, Fode Zhang, Dehuan Shen• 2026

Related benchmarks

Task	Dataset	Result
Hallucination Evaluation	MMHal-Bench	MMHal Score2.8	309
Hallucination Evaluation	AMBER	CHAIR4.6	267
Object Hallucination Evaluation	POPE Adversarial	Accuracy86.98	174
Object Hallucination Evaluation	POPE (Random)	Accuracy91.45	152
Object Hallucination Evaluation	POPE Popular	Accuracy90.73	100
Object Hallucination Evaluation	MSCOCO POPE	Random Accuracy90.3	71
Visual Hallucination Evaluation	POPE (average)	F1 Score88.93	14
Hallucination assessment	CHAIR	CS34.2	12
Object Hallucination Evaluation	AMBER	Hallucination Score (Hal)23.2	11

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord