Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Cooperative Multi-Agent Reinforcement Learning on SMAC 8m map
Loading...
17.8
Return
KD-MARL
5.528
8.714
11.9
15.086
Apr 8, 2026
Return
Win Rate
TPS
Updated 9d ago
Evaluation Results
Method
Method
Links
Return
Win Rate
TPS
KD-MARL
Configuration=LH
2026.04
17.8
88.97
17.3
KD-MARL
Configuration=LH+A
2026.04
17.6
88.23
15.8
MAPPO
Configuration=FO
2026.04
17
89.91
21.5
QMIX
Configuration=FO
2026.04
16
92.19
21.9
VDN
Configuration=FO
2026.04
15
75.32
19
MAPPO
Configuration=LH
2026.04
14
77.82
22
QMIX
Configuration=LH
2026.04
12.5
64.78
18.1
MAPPO
Configuration=LH+A
2026.04
10
60.07
21.8
VDN
Configuration=LH
2026.04
10
52.11
17.2
QMIX
Configuration=LH+A
2026.04
8.5
48.13
10.8
VDN
Configuration=LH+A
2026.04
6
33.05
15
Feedback
Search any
task
Search any
task