Multi-Agent Path Finding with Prioritized Communication Learning

About

Multi-agent pathfinding (MAPF) has been widely used to solve large-scale real-world problems, e.g., automation warehouses. The learning-based, fully decentralized framework has been introduced to alleviate real-time problems and simultaneously pursue optimal planning policy. However, existing methods might generate significantly more vertex conflicts (or collisions), which lead to a low success rate or more makespan. In this paper, we propose a PrIoritized COmmunication learning method (PICO), which incorporates the \textit{implicit} planning priorities into the communication topology within the decentralized multi-agent reinforcement learning framework. Assembling with the classic coupled planners, the implicit priority learning module can be utilized to form the dynamic communication topology, which also builds an effective collision-avoiding mechanism. PICO performs significantly better in large-scale MAPF tasks in success rates and collision rates than state-of-the-art learning-based planners.

Wenhao Li, Hongjun Chen, Bo Jin, Wenzhe Tan, Hongyuan Zha, Xiangfeng Wang• 2022

Related benchmarks

Task	Dataset	Result	Rank
Multi-robot Exploration	40x40 world, 4 Agents, 30% static obstacle density (100 runs)	Success Rate (SR)71		28
Multi-robot Exploration	80x80 world, 16 Agents, 30% static obstacle density 100 runs	Success Rate (SR)52		28

Showing 2 of 2 rows

Other info

Follow for update

@wizwand_team Discord