Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-Agent Reinforcement Learning on REF-q rac-dist (test)

125Mean Episodic Reward (q=2)

QMIX

-117.32-54.418.571.41Oct 14, 2022
Updated 4d ago

Evaluation Results

MethodLinks
2022.10
1253805775.97
2022.10
1162442318.171
2022.10
1133104549.36
2022.10
822582788.467
2022.10
441713007.737
2022.10
351922277.517
2022.10
23-134027.893
2022.10
11177747.904
2022.10
1126317.324
2022.10
-50-103-612.457
2022.10
-62-210-2711.204
2022.10
-108-371-6094.608