Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-Agent Game Abstraction via Graph Attention Neural Network

About

In large-scale multi-agent systems, the large number of agents and complex game relationship cause great difficulty for policy learning. Therefore, simplifying the learning process is an important research issue. In many multi-agent systems, the interactions between agents often happen locally, which means that agents neither need to coordinate with all other agents nor need to coordinate with others all the time. Traditional methods attempt to use pre-defined rules to capture the interaction relationship between agents. However, the methods cannot be directly used in a large-scale environment due to the difficulty of transforming the complex interactions between agents into rules. In this paper, we model the relationship between agents by a complete graph and propose a novel game abstraction mechanism based on two-stage attention network (G2ANet), which can indicate whether there is an interaction between two agents and the importance of the interaction. We integrate this detection mechanism into graph neural network-based multi-agent reinforcement learning for conducting game abstraction and propose two novel learning algorithms GA-Comm and GA-AC. We conduct experiments in Traffic Junction and Predator-Prey. The results indicate that the proposed methods can simplify the learning process and meanwhile get better asymptotic performance compared with state-of-the-art algorithms.

Yong Liu, Weixun Wang, Yujing Hu, Jianye Hao, Xingguo Chen, Yang Gao• 2019

Related benchmarks

TaskDatasetResultRank
Cooperative NavigationCooperative Navigation easy
Mean Episode Reward2.24
14
Cooperative NavigationCN MPE medium
Mean Episode Reward2.57
7
Cooperative NavigationCN MPE super_hard
Mean Episode Reward3.01
7
Multi-agent cooperationSMAC 1o_10b_vs_1r (easy)
Win Rate38.18
7
Predator-PreyPredator Prey easy
Mean Episode Reward-1.17
7
Cooperative NavigationCooperative Navigation hard
Mean Episode Reward-2.71
7
Cooperative NavigationCN MPE hard
Mean Episode Reward2.85
7
Multi-agent cooperationSMAC 1o_2r_vs_4r (easy)
Win Rate44.27
7
Multi-agent cooperationSMAC 1o_10b_vs_1r medium
Win Rate23.13
7
Multi-agent cooperationSMAC 1o_10b_vs_1r hard
Win Rate16.12
7
Showing 10 of 19 rows

Other info

Follow for update