Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus

About

Large Language Models (LLMs) have gained significant popularity for their impressive performance across diverse fields. However, LLMs are prone to hallucinate untruthful or nonsensical outputs that fail to meet user expectations in many real-world applications. Existing works for detecting hallucinations in LLMs either rely on external knowledge for reference retrieval or require sampling multiple responses from the LLM for consistency verification, making these methods costly and inefficient. In this paper, we propose a novel reference-free, uncertainty-based method for detecting hallucinations in LLMs. Our approach imitates human focus in factuality checking from three aspects: 1) focus on the most informative and important keywords in the given text; 2) focus on the unreliable tokens in historical context which may lead to a cascade of hallucinations; and 3) focus on the token properties such as token type and token frequency. Experimental results on relevant datasets demonstrate the effectiveness of our proposed method, which achieves state-of-the-art performance across all the evaluation metrics and eliminates the need for additional information.

Tianhang Zhang, Lin Qiu, Qipeng Guo, Cheng Deng, Yue Zhang, Zheng Zhang, Chenghu Zhou, Xinbing Wang, Luoyi Fu• 2023

Related benchmarks

Task	Dataset	Result
Hallucination Detection	TriviaQA	AUROC0.589	621
Hallucination Detection	TriviaQA (test)	AUC-ROC81.7	243
Hallucination Detection	HaluEval (test)	AUC-ROC78.1	176
Hallucination Detection	TruthfulQA (test)	AUC-ROC81.4	112
Uncertainty Estimation	TriviaQA	AUROC66	111
Hallucination Detection	CoQA	Mean AUROC0.53	107
Hallucination Detection	NQ (test)	AUC ROC79.4	91
Uncertainty Quantification	Aggregated Experimental Datasets (XSum, SamSum, CNN, WMT19, MedQUAD, TruthfulQA, CoQA, SciQ, TriviaQA, MMLU, GSM8k) (test)	Mean Rank7	88
Hallucination Detection	HELM Sentence Level v1.0 (test)	AUC0.7593	84
Hallucination Detection	HELM Passage Level v1.0 (test)	AUC0.8659	84

Showing 10 of 67 rows

Other info

Follow for update

@wizwand_team Discord