| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-Agent Path Finding (MAPF) | warehouse 161x63 | Success Rate100 | 31 | |
| Multi-Task Reinforcement Learning (LTL Instruction Following) | Warehouse Finite Horizon | Success Rate99 | 30 | |
| Multi-Agent Path Finding | Medium Warehouse 25x25 world size, 34.6% static obstacle rate | Success Rate100 | 20 | |
| Multi-Task Reinforcement Learning (LTL Instruction Following) | Warehouse Infinite Horizon | Average Visits880.6 | 20 | |
| Target Assignment and Pathfinding (TAPF) | warehouse 10-20-10-2-1 | Success Rate100 | 14 | |
| Offline Multi-agent Reinforcement Learning | Warehouse Small (11x20) | Mean Performance (N=2)5.97 | 6 | |
| Offline Multi-agent Reinforcement Learning | Warehouse Tiny (11x11) | Mean Performance (N=2)11.15 | 6 | |
| Interval Quality | Warehouse | Hit Rate50.2 | 5 | |
| Feasibility Prediction | Warehouse | F1@142 | 5 | |
| Task Performance | Warehouse | Turns Taken12 | 5 | |
| Exploration efficiency | Warehouse Mini | CE2.66 | 5 | |
| Exploration efficiency | Small Warehouse | CE1.08 | 5 | |
| Warehouse | Warehouse H3 | Mean Episode Reward268.9 | 5 | |
| Warehouse | Warehouse H2 | Mean Episode Reward269.7 | 5 | |
| Warehouse | Warehouse H1 | Mean Episode Reward270.8 | 5 | |
| Warehouse | Warehouse H0 | Mean Episode Reward269.5 | 5 | |
| Autonomous Exploration | Warehouse 1260m3 | Exploration Duration41.42 | 5 | |
| Short-term Navigation | Warehouse 2.5D | Average Prompts2.86 | 5 | |
| Mobile Manipulation | Warehouse Cross-room transfer v1 | AIKF14.3 | 5 | |
| Robotic Exploration | Warehouse Gazebo simulation (test) | Distance (m)86,942 | 5 | |
| Autonomous Robotic Exploration | Warehouse Gazebo simulation (environment) | Exploration Distance (m)869 | 5 | |
| Multi-agent coordination | Warehouse (uneven) size S | Success Rate100 | 5 | |
| Multi-agent coordination | Warehouse (even) size S | Success Rate1 | 5 | |
| Multi-agent coordination | Warehouse (WH) | Base Score473.65 | 4 | |
| Autonomous Exploration | Mini warehouse Simulated | Success Rate100 | 4 |