Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception

About

Large Vision-Language Models (LVLMs) have achieved impressive results across various cross-modal tasks. However, hallucinations, i.e., the models generating counterfactual responses, remain a challenge. Though recent studies have attempted to alleviate object perception hallucinations, they focus on the models' response generation, and overlooking the task question itself. This paper discusses the vulnerability of LVLMs in solving counterfactual presupposition questions (CPQs), where the models are prone to accept the presuppositions of counterfactual objects and produce severe hallucinatory responses. To this end, we introduce "Antidote", a unified, synthetic data-driven post-training framework for mitigating both types of hallucination above. It leverages synthetic data to incorporate factual priors into questions to achieve self-correction, and decouple the mitigation process into a preference optimization problem. Furthermore, we construct "CP-Bench", a novel benchmark to evaluate LVLMs' ability to correctly handle CPQs and produce factual responses. Applied to the LLaVA series, Antidote can simultaneously enhance performance on CP-Bench by over 50%, POPE by 1.8-3.3%, and CHAIR & SHR by 30-50%, all without relying on external supervision from stronger LVLMs or human feedback and introducing noticeable catastrophic forgetting issues.

Yuanchen Wu, Lu Zhang, Hang Yao, Junlong Du, Ke Yan, Shouhong Ding, Yunsheng Wu, Xiaoqiang Li• 2025

Related benchmarks

Task	Dataset	Result
Hallucination Evaluation	POPE	Accuracy88.06	281
Multimodal Evaluation	MM-Vet	--	249
Object Hallucination Evaluation	CHAIR	CHAIRi Score5.13	174
Hallucination Evaluation	HallusionBench	--	153
Question Answering	ScienceQA	Accuracy64.55	106
Multimodal Perception Assessment	MME Perception	MME-P1.42e+3	77
Counter-Perception Discrimination	CP-Bench (test)	F1 Score83.5	25
Counter-Perception Discrimination	CP-Bench (dev)	F1 Score88	25
Object Existence Hallucination Evaluation	POPE average across three subsets	Accuracy88.93	16
Image Description Hallucination Evaluation	MSCOCO (test)	CHAIR s12.6	11

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord