Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline Multi-agent Reinforcement Learning on Warehouse Tiny (11x11)

11.15Mean Performance (N=2)

AlberDICE

6.04367.36938.69510.0207Nov 3, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.11
11.1513.1115.72
2023.11
9.3812.1314.59
2023.11
8.811.1214.06
2023.11
8.711.1314.02
2023.11
6.7714.3916.13
2023.11
6.249.913.06