| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Sim2Real Regression | Predator-Prey Real | Context Likelihood2.451 | 16 | |
| Sim2Real Regression | Predator-Prey Simulation | Context Likelihood271.1 | 16 | |
| Predator Prey | Predator Prey super_hard | Mean Episode Reward-0.97 | 7 | |
| Predator Prey | Predator Prey hard | Mean Episode Reward-1.02 | 7 | |
| Predator Prey | Predator Prey medium | Mean Episode Reward-0.91 | 7 | |
| Predator Prey | Predator Prey easy | Mean Episode Reward-0.93 | 7 | |
| Trajectory Forecasting | Predator-prey data T=100 (test) | Test MSE0.74 | 6 | |
| Multi-agent Coordination | Predator-Prey | IQM Return2.547 | 5 | |
| Ad-hoc teamwork | Predator Prey pp_v0 | Steps10.9 | 5 | |
| Ad-hoc teamwork | Predator Prey v1 | Steps10.6 | 5 | |
| Causal Discovery | Predator-Prey | Causal Strength5 | 4 | |
| ODE discovery | Predator-prey Lynx-Hares | MP1 | 4 | |
| Factor of Variation Prediction | Predator-Prey (PP) (test) | R20.39 | 2 | |
| End-to-end learning and planning | Predator Prey | Cost (Best)1.79 | 1 | |
| Q-only open-loop forecasting | Predator-prey damped UNKNOWN regime (test) | Metric- | 0 | |
| Communication Alignment Similarity | Predator Prey v0 | Metric- | 0 | |
| Communication Alignment Similarity | Predator Prey v1 | Metric- | 0 |