Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CooT: Learning to Coordinate In-Context with Coordination Transformers

About

Effective coordination among unfamiliar partners remains a major challenge in multi-agent systems. Existing approaches, such as population-based methods, improve robustness through diversity but often lack mechanisms for efficient adaptation beyond training distribution. Moreover, fine-tuning is impractical in few-shot settings due to its high interaction cost. To address these limitations, we propose CooT, a framework that leverages in-context learning (ICL) for real-time partner adaptation. Unlike prior ICL approaches that focus on task generalization, CooT is designed to generalize across diverse partner behaviors. Trained on trajectories from behavior-preferring agents, it learns to align actions with partner intentions purely through observation. We evaluate CooT on two challenging multi-agent benchmarks: Overcooked and Google Research Football. Results show that CooT consistently outperforms population-based methods, gradient-based fine-tuning, and Meta-RL baselines, achieving stable and rapid adaptation without parameter updates. Human evaluations also identify CooT as a preferred collaborator, and our ablations confirm its ability to adapt quickly to new partners and remain stable under sudden partner changes, making it reliable for real-world human-AI collaboration.

Huai-Chih Wang, Hsiang-Chun Chuang, Hsi-Chun Cheng, Dai-Jie Wu, Shao-Hua Sun• 2025

Related benchmarks

TaskDatasetResultRank
Multi-agent coordinationOvercooked Coord. Ring
Average Return38.3
10
Multi-agent coordinationOvercooked Coord. Ring Multi-recipe
Return45.96
8
Multi-agent coordinationOvercooked Counter Circ.
Return28.28
8
Multi-agent coordinationOvercooked Bothway Coord.
Return101.9
8
Multi-agent coordinationOvercooked Overall
Return68.79
8
Multi-agent coordinationOvercooked Asymm. Adv.
Return129.5
8
Multi-agent coordinationGoogle Research Football (GRF) 3-vs-1 with keeper
Goal Rate2.5
5
Human-AI CoordinationOvercooked Coord. Ring layout (Human Study)
Return63.5
4
CoordinationCounter Circ. layout (test)
Coordination Score (Ep 1)21.33
2
CoordinationOvercooked Coord. Ring
Episode 1 Score29.63
2
Showing 10 of 10 rows

Other info

Follow for update