| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Simulation Task Planning | BEHAVIOR-1K 15 tasks | BT Valid100 | 14 | |
| Progress Estimation | Behavior | MRA87.08 | 12 | |
| Long-Horizon Household Tasks | BEHAVIOR-1K | Fitting44.7 | 12 | |
| Visual Planning | MINIBEHAVIOR | EM7,580 | 8 | |
| Autonomous Driving | Behavior Shifted Environment (test) | Testing Reward1.02 | 8 | |
| Robot Learning | BEHAVIOR 2025 (private) | Binary Success12.4 | 5 | |
| Robot Learning | BEHAVIOR 2025 (public) | Binary Success14.4 | 5 | |
| Household Planning | Behavior-1K | Success Rate84.4 | 5 | |
| Pick up Soda Can | BEHAVIOR | Navigational Success Rate84 | 3 | |
| Pick up Radio | BEHAVIOR | Navigation Success Rate88 | 3 | |
| ADS Testing | Behavior | Execution Time (s)43.66 | 3 | |
| Motion Planning | BEHAVIOR Franka MM (test) | Motion Completion Time (sec)5.03 | 3 | |
| Motion Planning | BEHAVIOR HSR 1488 samples (test) | Motion Completion Time (sec)5.01 | 3 |