| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-agent Offline Reinforcement Learning | MPE CN (Medium-replay) | Score52.2 | 8 | |
| Multi-agent Offline Reinforcement Learning | MPE CN (Random) | Score62.2 | 8 | |
| Multi-agent Reinforcement Learning | MPE Predator-prey (PP) v1 (Expert) | Normalized Score118.2 | 4 | |
| Multi-agent Reinforcement Learning | MPE Predator-prey (PP) v1 (Med-Rep) | Normalized Score71.1 | 4 | |
| Multi-agent Reinforcement Learning | MPE Predator-prey (PP) v1 (Random) | Normalized Score78.5 | 4 | |
| Multi-agent Reinforcement Learning | MPE Cooperative Navigation (CN) v1 (Expert) | Normalized Score114.9 | 4 |