Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Multi-Agent Reinforcement Learning on Bridge (Mix)

-1.29Mean Return

AlberDICE

-6.7916-5.3633-3.935-2.5067Nov 3, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2023.11
-1.29
2023.11
-1.76
2023.11
-5.88
2023.11
-6.01
2023.11
-6.01
2023.11
-6.58