Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Models with Conformal Factuality Guarantees

About

Guaranteeing the correctness and factuality of language model (LM) outputs is a major open problem. In this work, we propose conformal factuality, a framework that can ensure high probability correctness guarantees for LMs by connecting language modeling and conformal prediction. We observe that the correctness of an LM output is equivalent to an uncertainty quantification problem, where the uncertainty sets are defined as the entailment set of an LM's output. Using this connection, we show that conformal prediction in language models corresponds to a back-off algorithm that provides high probability correctness guarantees by progressively making LM outputs less specific (and expanding the associated uncertainty sets). This approach applies to any black-box LM and requires very few human-annotated samples. Evaluations of our approach on closed book QA (FActScore, NaturalQuestions) and reasoning tasks (MATH) show that our approach can provide 80-90% correctness guarantees while retaining the majority of the LM's original output.

Christopher Mohri, Tatsunori Hashimoto• 2024

Related benchmarks

TaskDatasetResultRank
Reinforcement Learning from Verifiable RewardsHEAD-QA
AR48.6
30
Distribution Shift RobustnessSixteen Adversarial Cells MedQA + GSM8K (eval)
Violations7
10
Expert-Iteration RLVRMedQA, HEAD-QA, ARC-C, and CaseHOLD
Pathwise Clean Score3
10
Natural Language InferencemedNLI
AR (%)76.8
10
Mathematical ReasoningGSM8K
AR (%)9
10
Question AnsweringMedQA
AR (%)28.4
9
Question AnsweringCaseHold
AR (%)22.6
9
Question AnsweringPubMedQA
AR (%)34.8
8
Natural Language InferenceMedNLI (eval)
Risk13.5
4
Question AnsweringTAT-QA (eval)
Risk8.2
4
Showing 10 of 30 rows

Other info

Follow for update