Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Don't Throw Away Your Beams: Improving Consistency-based Uncertainties in LLMs via Beam Search

About

Consistency-based methods have emerged as an effective approach to uncertainty quantification (UQ) in large language models. These methods typically rely on several generations obtained via multinomial sampling, measuring their agreement level. However, in short-form QA, multinomial sampling is prone to producing duplicates due to peaked distributions, and its stochasticity introduces considerable variance in uncertainty estimates across runs. We introduce a new family of methods that employ beam search to generate candidates for consistency-based UQ, yielding improved performance and reduced variance compared to multinomial sampling. We also provide a theoretical lower bound on the beam set probability mass under which beam search achieves a smaller error than multinomial sampling. We empirically evaluate our approach on six QA datasets and find that its consistent improvements over multinomial sampling lead to state-of-the-art UQ performance.

Ekaterina Fadeeva, Maiya Goloburda, Aleksandr Rubashevskii, Roman Vashurin, Artem Shelmanov, Preslav Nakov, Mrinmaya Sachan, Maxim Panov• 2025

Related benchmarks

TaskDatasetResultRank
Uncertainty QuantificationAverage of 6 datasets
PRR65
120
Question AnsweringTriviaQA
PR-AUC0.766
16
Question AnsweringWeb Questions
PR-AUC72.2
16
Question AnsweringCoQA
PR-AUC60
16
Question AnsweringHotpotQA
PR-AUC0.681
16
Question AnsweringCommonsenseQA
PR-AUC0.595
16
Question AnsweringARC Challenge
PR-AUC0.636
16
Showing 7 of 7 rows

Other info

Follow for update