Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Theory of Mind for Multi-Agent Collaboration via Large Language Models

About

While Large Language Models (LLMs) have demonstrated impressive accomplishments in both reasoning and planning, their abilities in multi-agent collaborations remains largely unexplored. This study evaluates LLM-based agents in a multi-agent cooperative text game with Theory of Mind (ToM) inference tasks, comparing their performance with Multi-Agent Reinforcement Learning (MARL) and planning-based baselines. We observed evidence of emergent collaborative behaviors and high-order Theory of Mind capabilities among LLM-based agents. Our results reveal limitations in LLM-based agents' planning optimization due to systematic failures in managing long-horizon contexts and hallucination about the task state. We explore the use of explicit belief state representations to mitigate these issues, finding that it enhances task performance and the accuracy of ToM inferences for LLM-based agents.

Huao Li, Yu Quan Chong, Simon Stepputtis, Joseph Campbell, Dana Hughes, Michael Lewis, Katia Sycara• 2023

Related benchmarks

TaskDatasetResultRank
Cooperative Adaptive Cruise ControlCACC Slow-down scenario
RMSE (Horizontal)0.641
7
Pandemic ControlNew York topology
$I_n$ (Infection Rate)0.13
7
Pandemic ControlHong Kong topology
Infection Status (I_n)23.9
7
Pandemic ControlHelsinki topology
Infection Rate (I_n)18.4
7
Cooperative Adaptive Cruise ControlCACC Catch-up scenario
RMSE (Lateral)5.514
7
Ad-hoc teamworkUSAR
Steps15.9
5
Ad-hoc teamworkPredator Prey pp_v0
Steps11.6
5
Ad-hoc teamworkPredator Prey v1
Steps6.8
5
Showing 8 of 8 rows

Other info

Follow for update