Think Twice: Perspective-Taking Improves Large Language Models' Theory-of-Mind Capabilities

About

Human interactions are deeply rooted in the interplay of thoughts, beliefs, and desires made possible by Theory of Mind (ToM): our cognitive ability to understand the mental states of ourselves and others. Although ToM may come naturally to us, emulating it presents a challenge to even the most advanced Large Language Models (LLMs). Recent improvements to LLMs' reasoning capabilities from simple yet effective prompting techniques such as Chain-of-Thought have seen limited applicability to ToM. In this paper, we turn to the prominent cognitive science theory "Simulation Theory" to bridge this gap. We introduce SimToM, a novel two-stage prompting framework inspired by Simulation Theory's notion of perspective-taking. To implement this idea on current ToM benchmarks, SimToM first filters context based on what the character in question knows before answering a question about their mental state. Our approach, which requires no additional training and minimal prompt-tuning, shows substantial improvement over existing methods, and our analysis reveals the importance of perspective-taking to Theory-of-Mind capabilities. Our findings suggest perspective-taking as a promising direction for future research into improving LLMs' ToM capabilities.

Alex Wilf, Sihyun Shawn Lee, Paul Pu Liang, Louis-Philippe Morency• 2023

Related benchmarks

Task	Dataset	Result
Theory of Mind	BigToM	Accuracy99	64
Theory of Mind	HiToM	Accuracy71	64
Theory of Mind	ToMi	Accuracy79.9	55
Theory of Mind reasoning	MuMa-ToM	Accuracy49.6	45
Theory of Mind reasoning	MMToM-QA	Overall Accuracy51	44
Theory of Mind reasoning	BigTOM (All)	Accuracy95.5	24
Mental State Inference	MMToM-QA human 1.0 (test)	Sub-score 1.1100	20
Theory of Mind reasoning	BigTOM False Belief	Accuracy93.25	18
Theory of Mind reasoning	ToMI False Belief	Accuracy95.5	18
Theory of Mind reasoning	MMToM-QA Text-only	Belief Inference 1.10.96	17

Showing 10 of 18 rows

Other info

Follow for update

@wizwand_team Discord