Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adaptive Theory of Mind for LLM-based Multi-Agent Coordination

About

Theory of Mind (ToM) refers to the ability to reason about others' mental states, and higher-order ToM involves considering that others also possess their own ToM. Equipping large language model (LLM)-driven agents with ToM has long been considered to improve their coordination in multiagent collaborative tasks. However, we find that misaligned ToM orders-mismatches in the depth of ToM reasoning between agents-can lead to insufficient or excessive reasoning about others, thereby impairing their coordination. To address this issue, we design an adaptive ToM (A-ToM) agent, which can align in ToM orders with its partner. Based on prior interactions, the agent estimates the partner's likely ToM order and leverages this estimation to predict the partner's action, thereby facilitating behavioral coordination. We conduct empirical evaluations on four multi-agent coordination tasks: a repeated matrix game, two grid navigation tasks and an Overcooked task. The results validate our findings on ToM alignment and demonstrate the effectiveness of our A-ToM agent. Furthermore, we discuss the generalizability of our A-ToM to non-LLM-based agents, as well as what would diminish the importance of ToM alignment.

Chunjiang Mu, Ya Zeng, Qiaosheng Zhang, Kun Shao, Chen Chu, Hao Guo, Danyang Jia, Zhen Wang, Shuyue Hu• 2026

Related benchmarks

TaskDatasetResultRank
CoordinationMemory-1
Point Score75
14
CoordinationMemory-N
Point Score75
14
CoordinationGame 1
Time5.87
14
CoordinationGame 2
Time7
14
CoordinationOvercooked
Time43.53
14
Multi-agent coordinationOvercooked cross-play official AI library
Time48.05
7
Showing 6 of 6 rows

Other info

Follow for update