Adaptive Theory of Mind for LLM-based Multi-Agent Coordination

About

Theory of Mind (ToM) refers to the ability to reason about others' mental states, and higher-order ToM involves considering that others also possess their own ToM. Equipping large language model (LLM)-driven agents with ToM has long been considered to improve their coordination in multiagent collaborative tasks. However, we find that misaligned ToM orders-mismatches in the depth of ToM reasoning between agents-can lead to insufficient or excessive reasoning about others, thereby impairing their coordination. To address this issue, we design an adaptive ToM (A-ToM) agent, which can align in ToM orders with its partner. Based on prior interactions, the agent estimates the partner's likely ToM order and leverages this estimation to predict the partner's action, thereby facilitating behavioral coordination. We conduct empirical evaluations on four multi-agent coordination tasks: a repeated matrix game, two grid navigation tasks and an Overcooked task. The results validate our findings on ToM alignment and demonstrate the effectiveness of our A-ToM agent. Furthermore, we discuss the generalizability of our A-ToM to non-LLM-based agents, as well as what would diminish the importance of ToM alignment.

Chunjiang Mu, Ya Zeng, Qiaosheng Zhang, Kun Shao, Chen Chu, Hao Guo, Danyang Jia, Zhen Wang, Shuyue Hu• 2026

Related benchmarks

Task	Dataset	Result
Coordination	Memory-1	Point Score75	14
Coordination	Memory-N	Point Score75	14
Coordination	Game 1	Time5.87	14
Coordination	Game 2	Time7	14
Coordination	Overcooked	Time43.53	14
Multi-agent coordination	Overcooked cross-play official AI library	Time48.05	7

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord