Aligning with Your Own Voice: Self-Corrected Preference Learning for Hallucination Mitigation in LVLMs
About
Large Vision-Language Models (LVLMs) frequently suffer from hallucinations. Existing preference learning-based approaches largely rely on proprietary models to construct preference datasets. We identify that this reliance introduces a distributional mismatch between the proprietary and target models that hinders efficient alignment. To address this, we propose Alignment via VErified Self-correction DPO (AVES-DPO), a framework that aligns LVLMs using in-distribution data derived from the model's intrinsic knowledge. Our approach employs a consensus-based verification mechanism to diagnose diverse hallucinations and guides the model to self-correct, thereby generating preference pairs strictly compatible with its internal distribution. Extensive experiments demonstrate that AVES-DPO surpasses existing baselines in hallucination mitigation while requiring only 5.2k samples.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Hallucination Evaluation | AMBER | CHAIR3.9 | 222 | |
| Hallucination Evaluation | POPE | -- | 217 | |
| Multimodal Hallucination Evaluation | MMHal-Bench | Average Score2.36 | 129 | |
| Hallucination Evaluation | Object-HalBench | CHAIR Score (s)12.2 | 78 | |
| Hallucination Evaluation | MMHal-Bench-V | Hallucination Score2.41 | 9 | |
| General Multimodal Perception and Recognition | MME Perception | Existence Score195 | 7 |