Risk-aware Selective Prompting for Hallucination Mitigation in Large Vision-Language Models

About

Prompt-based verification is widely used to mitigate hallucinations in large vision-language models (LVLMs), yet when it helps remains poorly understood. We systematically study verification prompting across two representative LVLM architectures and hallucination benchmarks, and find that it is a risk-bearing intervention: its corrections increase with input difficulty, while newly introduced errors persist across difficulty levels. As a result, always-on prompting helps on hard inputs but offers little benefit -- and can harm -- easier ones. Our analysis further shows that this behavior is associated with a conservative output shift. Verification prompts redistribute attention from visual tokens toward instruction tokens and induce a distinct middle-layer entropy pattern absent in a neutral-prompt control, suggesting instruction-conditioned attention redistribution rather than uniformly improved visual grounding. Motivated by this input-dependent risk, we propose Risk-aware Selective Prompting (RSP), a training-free approach that uses pre-generation uncertainty signals to trigger verification selectively. RSP mitigates the degradation of always-on prompting while preserving baseline performance, and reveals that effective selection signals vary across architectures.

Yuang Huang, Yafeng Zhang, Yu Zilan• 2026

Related benchmarks

Task	Dataset	Result
Object Hallucination	POPE Adversarial	--	367
Object Hallucination Detection	POPE Popular	F1 Score86.5	27
Object Hallucination Detection	POPE (Random)	F1 Score89.1	6

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord