Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safe Multi-Agent Reinforcement Learning on Wireless Communication 25 agents
Loading...
19.26
Constraint Violation Rate
Scalable Primal-Dual Actor-Critic
13.7264
51.0782
88.43
125.7818
May 27, 2023
Constraint Violation Rate
Episodic Return
Updated 1mo ago
Evaluation Results
Method
Method
Links
Constraint Violation Rate
Episodic Return
Scalable Primal-Dual Actor-Critic
Algorithm=Ours
2023.05
19.26
-
MAPPO-L
Information Access=Global
2023.05
40
-
Decentralized Aggregate MAPPO-L
Information Access=Loc...
2023.05
118.9
-
Decentralized MAPPO-L
Information Access=Loc...
2023.05
157.6
-
Feedback
Search any
task
Search any
task