Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models

About

Existing LLM reasoning methods have shown impressive capabilities across various tasks, such as solving math and coding problems. However, applying these methods to scenarios without ground-truth answers or rule-based verification methods - such as tracking the mental states of an agent - remains challenging. Inspired by the sequential Monte Carlo algorithm, we introduce thought-tracing, an inference-time reasoning algorithm designed to trace the mental states of specific agents by generating hypotheses and weighting them based on observations without relying on ground-truth solutions to questions in datasets. Our algorithm is modeled after the Bayesian theory-of-mind framework, using LLMs to approximate probabilistic inference over agents' evolving mental states based on their perceptions and actions. We evaluate thought-tracing on diverse theory-of-mind benchmarks, demonstrating significant performance improvements compared to baseline LLMs. Our experiments also reveal interesting behaviors of the recent reasoning models - e.g., o3 and R1 - on theory-of-mind, highlighting the difference of social reasoning compared to other domains.

Hyunwoo Kim, Melanie Sclar, Tan Zhi-Xuan, Lance Ying, Sydney Levine, Yang Liu, Joshua B. Tenenbaum, Yejin Choi• 2025

Related benchmarks

TaskDatasetResultRank
Question AnsweringHousehold (full)
Accuracy72.3
25
Question AnsweringGridWorld (full)
Accuracy64
22
Theory of Mind Question AnsweringMMToM-QA
Accuracy69
6
Theory of Mind Question AnsweringFanToM
Accuracy87.4
4
Showing 4 of 4 rows

Other info

Follow for update