Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation

About

Uncertainty estimation is a significant issue for current large language models (LLMs) that are generally poorly calibrated and over-confident, especially with reinforcement learning from human feedback (RLHF). Unlike humans, whose decisions and confidences not only stem from intrinsic beliefs but can also be adjusted through daily observations, existing calibration methods for LLMs focus on estimating or eliciting individual confidence without taking full advantage of the "Collective Wisdom": the interaction among multiple LLMs that can collectively improve both accuracy and calibration. In this work, we propose Collaborative Calibration, a post-hoc training-free calibration strategy that leverages the collaborative and expressive capabilities of multiple tool-augmented LLM agents in a simulated group deliberation process. We demonstrate the effectiveness of Collaborative Calibration on generative QA tasks across various domains, showing its potential in harnessing the rationalization of collectively calibrated confidence assessments and improving the reliability of model predictions.

Ruixin Yang, Dheeraj Rajagopal, Shirley Anugrah Hayati, Bin Hu, Dongyeop Kang• 2024

Related benchmarks

Task	Dataset	Result
Question Answering	TriviaQA	BS (%)10.32	65
Calibration	TriviaQA	--	39
Calibration	TruthfulQA	--	32
Calibration	GSM8K	ECE8.01	11
Calibration	BBH	ECE11.59	11
Math Reasoning	GSM8K	Brier Score8.41	11
Reasoning	BBH	Brier Score (BBH)18.26	11
Calibration	MMLU-Pro	ECE31.3	11
Calibration	Mean macro-average across benchmarks	Expected Calibration Error (ECE)22.52	11
Language Understanding	MMLU-Pro	Brier Score33.71	11

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord