Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Aligning with Your Own Voice: Self-Corrected Preference Learning for Hallucination Mitigation in LVLMs

About

Large Vision-Language Models (LVLMs) frequently suffer from hallucinations. Existing preference learning-based approaches largely rely on proprietary models to construct preference datasets. We identify that this reliance introduces a distributional mismatch between the proprietary and target models that hinders efficient alignment. To address this, we propose Alignment via VErified Self-correction DPO (AVES-DPO), a framework that aligns LVLMs using in-distribution data derived from the model's intrinsic knowledge. Our approach employs a consensus-based verification mechanism to diagnose diverse hallucinations and guides the model to self-correct, thereby generating preference pairs strictly compatible with its internal distribution. Extensive experiments demonstrate that AVES-DPO surpasses existing baselines in hallucination mitigation while requiring only 5.2k samples.

Byeonggeuk Lim, JungMin Yun, Junehyoung Kwon, Kyeonghyun Kim, YoungBin Kim• 2026

Related benchmarks

TaskDatasetResultRank
Hallucination EvaluationAMBER
CHAIR3.9
222
Hallucination EvaluationPOPE--
217
Multimodal Hallucination EvaluationMMHal-Bench
Average Score2.36
129
Hallucination EvaluationObject-HalBench
CHAIR Score (s)12.2
78
Hallucination EvaluationMMHal-Bench-V
Hallucination Score2.41
9
General Multimodal Perception and RecognitionMME Perception
Existence Score195
7
Showing 6 of 6 rows

Other info

Follow for update