Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline Multi-Agent Reinforcement Learning on Bridge Optimal

-1.27Mean Return

AlberDICE

-6.1996-4.9198-3.64-2.3602Nov 3, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.11
-1.27
2023.11
-1.81
2023.11
-2.21
2023.11
-2.71
2023.11
-4.31
2023.11
-6.01