Cooperative Exploration for Multi-Agent Deep Reinforcement Learning

About

Exploration is critical for good results in deep reinforcement learning and has attracted much attention. However, existing multi-agent deep reinforcement learning algorithms still use mostly noise-based techniques. Very recently, exploration methods that consider cooperation among multiple agents have been developed. However, existing methods suffer from a common challenge: agents struggle to identify states that are worth exploring, and hardly coordinate exploration efforts toward those states. To address this shortcoming, in this paper, we propose cooperative multi-agent exploration (CMAE): agents share a common goal while exploring. The goal is selected from multiple projected state spaces via a normalized entropy-based technique. Then, agents are trained to reach this goal in a coordinated manner. We demonstrate that CMAE consistently outperforms baselines on various tasks, including a sparse-reward version of the multiple-particle environment (MPE) and the Starcraft multi-agent challenge (SMAC).

Iou-Jen Liu, Unnat Jain, Raymond A. Yeh, Alexander G. Schwing• 2021

Related benchmarks

Task	Dataset	Result
Exploration	MPE Push-Box	Exploration Steps (k)972.3	2
Exploration	MPE Secret-Room	Exploration Steps (k)1.45e+6	2
Exploration	MPE Pass	Exploration Steps (k)2.11e+3	2
Exploration	MPE Large-Pass	Exploration Steps (thousands)3.00e+3	2

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord