Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Aligning LLMs with Human Uncertainty: A Beta-Bernoulli Calibrator for LLM Forecasting

About

Probabilistic forecasting estimates the likelihood of uncertain future events. To improve LLM forecasting, existing methods typically learn from binary outcomes to output verbalized forecasts. However, while aggregated human forecasts contain rich information in both the crowd probability estimate and the degree of agreement among forecasters, how to utilize these signals remains underexplored. To address this, we propose the Beta-Bernoulli Calibrator (BBC), which converts an initial point estimate forecast from any model into a distribution over event likelihood, using supervision from both binary outcomes and human forecasts. BBC models event likelihood $p \sim \text{Beta}(\alpha, \beta)$ and outcome $y \sim \text{Bernoulli}(p)$, with the mean as the calibrated point forecast and the variance as the epistemic uncertainty. Our results show that BBC generally provides better calibrated and more accurate forecasts than both traditional post-hoc calibration methods and models fine-tuned specifically for forecasting, while remaining lightweight and having good generalization. We also show that the epistemic uncertainty captured by BBC is a more reliable predictor of forecasting error than verbalized confidence.

Hui Dai, Ryan Teehan, Parsa Torabian, Mengye Ren• 2026

Related benchmarks

TaskDatasetResultRank
Probabilistic ForecastingMetaculus and Polymarket (test)
Brier Score0.125
30
ForecastingForecasting (test)
Brier Score0.133
21
ForecastingKalshi August 2025 resolution filter (OOD)
Brier Score0.228
10
ForecastingMain Forecasting Dataset
Brier Score0.132
8
Showing 4 of 4 rows

Other info

Follow for update