| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Keep-Away (2v2) | Multi-agent particle environment (MPE) (test) | Mean Episode Extrinsic Reward52.14 | 7 | |
| Predator-Prey (2v2) | Multi-agent particle environment (MPE) (test) | Mean Episode Extrinsic Reward-0.77 | 7 | |
| Physical Deception (2v1) | Multi-agent particle environment (MPE) (test) | Mean Extrinsic Reward101.72 | 7 | |
| Heterogeneous Navigation (4v0) | Multi-agent particle environment (MPE) (test) | Mean Episode Extrinsic Reward311.67 | 6 | |
| Cooperative Navigation (3v0) | Multi-agent particle environment (MPE) (test) | Mean Episode Extrinsic Reward155.88 | 6 |