Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space

About

With the widespread application of Large Language Models (LLMs) to various domains, concerns regarding the trustworthiness of LLMs in safety-critical scenarios have been raised, due to their unpredictable tendency to hallucinate and generate misinformation. Existing LLMs do not have an inherent functionality to provide the users with an uncertainty/confidence metric for each response it generates, making it difficult to evaluate trustworthiness. Although several studies aim to develop uncertainty quantification methods for LLMs, they have fundamental limitations, such as being restricted to classification tasks, requiring additional training and data, considering only lexical instead of semantic information, and being prompt-wise but not response-wise. A new framework is proposed in this paper to address these issues. Semantic density extracts uncertainty/confidence information for each response from a probability distribution perspective in semantic space. It has no restriction on task types and is "off-the-shelf" for new models and tasks. Experiments on seven state-of-the-art LLMs, including the latest Llama 3 and Mixtral-8x22B models, on four free-form question-answering benchmarks demonstrate the superior performance and robustness of semantic density compared to prior approaches.

Xin Qiu, Risto Miikkulainen• 2024

Related benchmarks

Task	Dataset	Result
Correctness Prediction	TriviaQA	AUROC0.8244	113
Question Answering	QA	Mean PRR38.5	109
Summarization	Summ.	Mean PRR0.217	109
Machine Translation	MT	Mean PRR29.1	109
Hallucination Detection	CoQA	Mean AUROC0.61	107
Uncertainty Quantification	Aggregated Experimental Datasets (XSum, SamSum, CNN, WMT19, MedQUAD, TruthfulQA, CoQA, SciQ, TriviaQA, MMLU, GSM8k) (test)	Mean Rank11.17	88
Selective Generation	CoQA	ROC-AUC73.4	66
Selective Generation	TriviaQA	ROC-AUC85.8	66
Question Answering	SciQ	PRR51.4	66
Question Answering	MedQUAD	PRR9.5	66

Showing 10 of 57 rows

Other info

Follow for update

@wizwand_team Discord