Bayesian Teaching Enables Probabilistic Reasoning in Large Language Models

About

Large language models (LLMs) are increasingly used as agents that interact with users and with the world. To do so successfully, LLMs must construct representations of the world and form probabilistic beliefs about them. To provide personalized recommendations, for example, the LLM needs to infer a user's preferences from their behavior over multiple interactions. The Bayesian inference framework lays out the optimal way for an agent to update its beliefs as it receives new information. We first show that LLMs fall far short of the standard defined by the Bayesian framework. We then show that by teaching LLMs to mimic the predictions of the normative Bayesian model, we can dramatically improve their ability to update their beliefs; this ability generalizes to new tasks. We conclude that LLMs can effectively learn reasoning skills from examples and generalize those skills to new domains.

Linlu Qiu, Fei Sha, Kelsey Allen, Yoon Kim, Tal Linzen, Sjoerd van Steenkiste• 2025

Related benchmarks

Task	Dataset	Result
Flight Recommendation	Flight Recommendation 1st Round	Accuracy55.5	18
Flight Recommendation	Flight Recommendation Final 5th Round	Accuracy74.3	18
Flight Recommendation	Flight recommendation	Inference Time per Round (s)1.05	15

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord