Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Cooperative Multi-Agent Reinforcement Learning on Disperse (last 2% of train)
Loading...
-0.36
Mean Episodic Reward
DICG
-4.7176
-3.5863
-2.455
-1.3237
May 8, 2026
Mean Episodic Reward
Updated 22d ago
Evaluation Results
Method
Method
Links
Mean Episodic Reward
DICG
2026.05
-0.36
SACHI
L=2, d=64, K=1
2026.05
-0.37
DGN
2026.05
-0.39
FOP
2026.05
-1.12
DCG
2026.05
-1.16
CASEC
2026.05
-2
IQL
2026.05
-2.36
MAPPO
2026.05
-2.54
QMIX
2026.05
-2.57
VDN
2026.05
-2.59
IPPO
2026.05
-2.78
QPLEX
2026.05
-2.99
QTRAN
2026.05
-4.55
Feedback
Search any
task
Search any
task