Aligning with Your Own Voice: Self-Corrected Preference Learning for Hallucination Mitigation in LVLMs

About

Large Vision-Language Models (LVLMs) frequently suffer from hallucinations. Existing preference learning-based approaches largely rely on proprietary models to construct preference datasets. We identify that this reliance introduces a distributional mismatch between the proprietary and target models that hinders efficient alignment. To address this, we propose Alignment via VErified Self-correction DPO (AVES-DPO), a framework that aligns LVLMs using in-distribution data derived from the model's intrinsic knowledge. Our approach employs a consensus-based verification mechanism to diagnose diverse hallucinations and guides the model to self-correct, thereby generating preference pairs strictly compatible with its internal distribution. Extensive experiments demonstrate that AVES-DPO surpasses existing baselines in hallucination mitigation while requiring only 5.2k samples.

Byeonggeuk Lim, JungMin Yun, Junehyoung Kwon, Kyeonghyun Kim, YoungBin Kim• 2026

Related benchmarks

Task	Dataset	Result
Hallucination Evaluation	POPE	--	281
Hallucination Evaluation	AMBER	CHAIR3.9	267
Multimodal Hallucination Evaluation	MMHal-Bench	Average Score2.36	140
Hallucination Evaluation	Object-HalBench	CHAIR Score (s)12.2	78
Hallucination Evaluation	MMHal-Bench-V	Hallucination Score2.41	9
General Multimodal Perception and Recognition	MME Perception	Existence Score195	7

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord