VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification

About

Large Vision-Language Models (LVLMs) may produce outputs that are unfaithful to reality, also known as visual hallucinations (VH), which significantly impedes their real-world usage. To alleviate VH, various decoding strategies have been proposed to enhance visual information. However, many of these methods may require secondary decoding and rollback, which significantly reduces inference speed. In this work, we propose an efficient plug-and-play decoding algorithm via Visual-Aware Sparsification (VASparse) from the perspective of token sparsity for mitigating VH. VASparse is inspired by empirical observations: (1) the sparse activation of attention in LVLMs, and (2) visual-agnostic tokens sparsification exacerbates VH. Based on these insights, we propose a novel token sparsification strategy that balances efficiency and trustworthiness. Specifically, VASparse implements a visual-aware token selection strategy during decoding to reduce redundant tokens while preserving visual context effectively. Additionally, we innovatively introduce a sparse-based visual contrastive decoding method to recalibrate the distribution of hallucinated outputs without the time overhead associated with secondary decoding. Subsequently, VASparse recalibrates attention scores to penalize attention sinking of LVLMs towards text tokens. Extensive experiments across four popular benchmarks confirm the effectiveness of VASparse in mitigating VH across different LVLM families without requiring additional training or post-processing. Impressively, VASparse achieves state-of-the-art performance for mitigating VH while maintaining competitive decoding speed. Code is available at https://github.com/mengchuang123/VASparse-github.

Xianwei Zhuang, Zhihong Zhu, Yuxin Xie, Liming Liang, Yuexian Zou• 2025

Related benchmarks

Task	Dataset	Result
Hallucination Evaluation	AMBER	--	222
Hallucination Evaluation	POPE	--	217
Visual Hallucination Evaluation	MSCOCO	CHAIR_i11.35	104
Object Hallucination Evaluation	POPE Popular offline	F1 Score82.44	84
Object Hallucination Evaluation	POPE Adversarial offline	F1 Score68.88	84
Object Hallucination Evaluation	POPE Random offline	F1 Score72.33	84
Hallucination Evaluation	MSCOCO (val)	CHAIR_i17.02	36
General Task	MMStar	Accuracy60.1	36
Hallucination	HallusionBench	HallusionBench Score65.1	26
Hallucination Evaluation	MME hallucination (test)	Existence Score180	24

Showing 10 of 19 rows

Other info

Code

Follow for update

@wizwand_team Discord