Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning Multi-Agent Communication from Graph Modeling Perspective

About

In numerous artificial intelligence applications, the collaborative efforts of multiple intelligent agents are imperative for the successful attainment of target objectives. To enhance coordination among these agents, a distributed communication framework is often employed. However, information sharing among all agents proves to be resource-intensive, while the adoption of a manually pre-defined communication architecture imposes limitations on inter-agent communication, thereby constraining the potential for collaborative efforts. In this study, we introduce a novel approach wherein we conceptualize the communication architecture among agents as a learnable graph. We formulate this problem as the task of determining the communication graph while enabling the architecture parameters to update normally, thus necessitating a bi-level optimization process. Utilizing continuous relaxation of the graph representation and incorporating attention units, our proposed approach, CommFormer, efficiently optimizes the communication graph and concurrently refines architectural parameters through gradient descent in an end-to-end manner. Extensive experiments on a variety of cooperative tasks substantiate the robustness of our model across diverse cooperative scenarios, where agents are able to develop more coordinated and sophisticated strategies regardless of changes in the number of agents.

Shengchao Hu, Li Shen, Ya Zhang, Dacheng Tao• 2024

Related benchmarks

TaskDatasetResultRank
Multitask Language UnderstandingMMLU
Accuracy82.35
520
Mathematical ReasoningSVAMP
Accuracy84.01
403
Arithmetic ReasoningMultiArith
Accuracy94.53
293
Math ReasoningAQUA
Accuracy72.95
188
Multi-Agent Reinforcement LearningSMAC 3s5z v1 (test)
Win Rate91.2
13
Multi-agent coordinationSMAC Zerg 5v5 v2
Median Win Rate39
10
Multi-agent coordinationSMAC Terran 5v5 v2
Median Win Rate30
10
Multi-Agent Reinforcement LearningSMAC 25m v1 (test)
Test Win Rate98.3
9
Multi-agent BattleMAgent Battle n=36 agents 3M training steps (test)
Final Return2.5
9
Multi-agent coordinationSMAC Zerg 5v6 v2
Median Win Rate16
9
Showing 10 of 29 rows

Other info

Follow for update