| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Long-Horizon Household Tasks | BEHAVIOR-1K | Fitting44.7 | 12 | |
| Visual Planning | MINIBEHAVIOR | EM7,580 | 8 | |
| Autonomous Driving | Behavior Shifted Environment (test) | Testing Reward1.02 | 8 | |
| Robot Learning | BEHAVIOR 2025 (private) | Binary Success12.4 | 5 | |
| Robot Learning | BEHAVIOR 2025 (public) | Binary Success14.4 | 5 | |
| Household Planning | Behavior-1K | Success Rate84.4 | 5 | |
| Motion Planning | BEHAVIOR Franka MM (test) | Motion Completion Time (sec)5.03 | 3 | |
| Motion Planning | BEHAVIOR HSR 1488 samples (test) | Motion Completion Time (sec)5.01 | 3 |