Alleviating Hallucinations of Large Language Models through Induced Hallucinations

About

Despite their impressive capabilities, large language models (LLMs) have been observed to generate responses that include inaccurate or fabricated information, a phenomenon commonly known as ``hallucination''. In this work, we propose a simple \textit{Induce-then-Contrast} Decoding (ICD) strategy to alleviate hallucinations. We first construct a factually weak LLM by inducing hallucinations from the original LLMs. Then, we penalize these induced hallucinations during decoding to enhance the factuality of the generated content. Concretely, we determine the final next-token predictions by amplifying the predictions from the original model and downplaying the induced untruthful predictions via contrastive decoding. Experimental results on both discrimination-based and generation-based hallucination evaluation benchmarks, such as TruthfulQA and \textsc{FActScore}, demonstrate that our proposed ICD methods can effectively enhance the factuality of LLMs across various model sizes and families. For example, when equipped with ICD, Llama2-7B-Chat and Mistral-7B-Instruct achieve performance comparable to ChatGPT and GPT4 on TruthfulQA, respectively.

Yue Zhang, Leyang Cui, Wei Bi, Shuming Shi• 2023

Related benchmarks

Task	Dataset	Result
Object Hallucination Evaluation	POPE	--	2019
Visual Question Answering	VizWiz	Accuracy46.9	1820
Question Answering	SQuAD 2.0	--	215
Multiple-Choice	TruthfulQA	MC1 Accuracy46.32	83
Multimodal Conversation	LLaVA-Bench Wild	Score69.7	78
Question Answering	TruthfulQA	--	73
Question Answering	TruthfulQA MC1	MC1 Accuracy46.32	54
Visual Question Answering	ScienceQA (SQA)	SQA Accuracy62.8	43
Visual Question Answering	MM-Vet	MM-Vet ASR Accuracy30.4	33
Truthfulness Evaluation	TruthfulQA (test)	MC137.87	30

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord