Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection

About

Recent studies have shown that large vision-language models (LVLMs) often suffer from the issue of object hallucinations (OH). To mitigate this issue, we introduce an efficient method that edits the model weights based on an unsafe subspace, which we call HalluSpace in this paper. With truthful and hallucinated text prompts accompanying the visual content as inputs, the HalluSpace can be identified by extracting the hallucinated embedding features and removing the truthful representations in LVLMs. By orthogonalizing the model weights, input features will be projected into the Null space of the HalluSpace to reduce OH, based on which we name our method Nullu. We reveal that HalluSpaces generally contain prior information in the large language models (LLMs) applied to build LVLMs, which have been shown as essential causes of OH in previous studies. Therefore, null space projection suppresses the LLMs' priors to filter out the hallucinated features, resulting in contextually accurate outputs. Experiments show that our method can effectively mitigate OH across different LVLM families without extra inference costs and also show strong performance in general LVLM benchmarks. Code is released at https://github.com/Ziwei-Zheng/Nullu.

Le Yang, Ziwei Zheng, Boxu Chen, Zhengyu Zhao, Chenhao Lin, Chao Shen• 2024

Related benchmarks

TaskDatasetResultRank
Object Hallucination EvaluationPOPE
Accuracy79.11
1455
Object HallucinationPOPE Adversarial
Accuracy79.4
288
Object HallucinationPOPE (Random)
F1 Score89.2
285
Object HallucinationPOPE Popular
F1 Score85.63
273
Hallucination EvaluationCHAIR
CHAIR_s50.2
252
Hallucination EvaluationMMHal-Bench
MMHal Score3.53
216
Visual Question AnsweringA-OKVQA
Acc86
202
Object Hallucination EvaluationMS-COCO (POPE Adversarial)
Accuracy84.11
138
Object Hallucination EvaluationMS-COCO POPE (Popular)
Accuracy86.06
108
Object Hallucination EvaluationPOPE A-OKVQA
Accuracy84.93
75
Showing 10 of 45 rows

Other info

Code

Follow for update