Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Cooperative Multi-Agent Reinforcement Learning on Adversary (last 2% of train)
Loading...
85.04
Mean Episodic Reward
SACHI
7.4664
27.6057
47.745
67.8843
May 8, 2026
Mean Episodic Reward
Updated 22d ago
Evaluation Results
Method
Method
Links
Mean Episodic Reward
SACHI
L=2, d=64, K=1
2026.05
85.04
DGN
2026.05
83.54
CASEC
2026.05
81.55
QTRAN
2026.05
79.79
QPLEX
2026.05
65.69
FOP
2026.05
44.68
DCG
2026.05
40.35
DICG
2026.05
36.51
IQL
2026.05
33.46
MAPPO
2026.05
31.83
IPPO
2026.05
23.54
VDN
2026.05
18.69
QMIX
2026.05
10.45
Feedback
Search any
task
Search any
task