Safe Multi-Agent Reinforcement Learning on Wireless Communication 25 agents

19.26Constraint Violation Rate

Scalable Primal-Dual Actor-Critic

Updated 1mo ago

Evaluation Results

Method
Scalable Primal-Dual Actor-Critic 2023.05	19.26	-
MAPPO-L 2023.05	40	-
Decentralized Aggregate MAPPO-L 2023.05	118.9	-
Decentralized MAPPO-L 2023.05	157.6	-