Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bayesian Teaching Enables Probabilistic Reasoning in Large Language Models

About

Large language models (LLMs) are increasingly used as agents that interact with users and with the world. To do so successfully, LLMs must construct representations of the world and form probabilistic beliefs about them. To provide personalized recommendations, for example, the LLM needs to infer a user's preferences from their behavior over multiple interactions. The Bayesian inference framework lays out the optimal way for an agent to update its beliefs as it receives new information. We first show that LLMs fall far short of the standard defined by the Bayesian framework. We then show that by teaching LLMs to mimic the predictions of the normative Bayesian model, we can dramatically improve their ability to update their beliefs; this ability generalizes to new tasks. We conclude that LLMs can effectively learn reasoning skills from examples and generalize those skills to new domains.

Linlu Qiu, Fei Sha, Kelsey Allen, Yoon Kim, Tal Linzen, Sjoerd van Steenkiste• 2025

Related benchmarks

TaskDatasetResultRank
Flight RecommendationFlight Recommendation 1st Round
Accuracy55.5
18
Flight RecommendationFlight Recommendation Final 5th Round
Accuracy74.3
18
Flight RecommendationFlight recommendation
Inference Time per Round (s)1.05
15
Showing 3 of 3 rows

Other info

Follow for update