Theory of Mind for Multi-Agent Collaboration via Large Language Models

About

While Large Language Models (LLMs) have demonstrated impressive accomplishments in both reasoning and planning, their abilities in multi-agent collaborations remains largely unexplored. This study evaluates LLM-based agents in a multi-agent cooperative text game with Theory of Mind (ToM) inference tasks, comparing their performance with Multi-Agent Reinforcement Learning (MARL) and planning-based baselines. We observed evidence of emergent collaborative behaviors and high-order Theory of Mind capabilities among LLM-based agents. Our results reveal limitations in LLM-based agents' planning optimization due to systematic failures in managing long-horizon contexts and hallucination about the task state. We explore the use of explicit belief state representations to mitigate these issues, finding that it enhances task performance and the accuracy of ToM inferences for LLM-based agents.

Huao Li, Yu Quan Chong, Simon Stepputtis, Joseph Campbell, Dana Hughes, Michael Lewis, Katia Sycara• 2023

Related benchmarks

Task	Dataset	Result
Cooperative Adaptive Cruise Control	CACC Slow-down scenario	RMSE (Horizontal)0.641	7
Pandemic Control	New York topology	$I_n$ (Infection Rate)0.13	7
Pandemic Control	Hong Kong topology	Infection Status (I_n)23.9	7
Pandemic Control	Helsinki topology	Infection Rate (I_n)18.4	7
Cooperative Adaptive Cruise Control	CACC Catch-up scenario	RMSE (Lateral)5.514	7
Ad-hoc teamwork	USAR	Steps15.9	5
Ad-hoc teamwork	Predator Prey pp_v0	Steps11.6	5
Ad-hoc teamwork	Predator Prey v1	Steps6.8	5

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord