| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Robotic Task Perception | RoboFAC real-robot | VOC Success Rate93.01 | 8 | |
| Robot Failure Analysis (MCQ) | RoboFAC (Real-world) | FD96 | 7 | |
| Robot Failure Analysis (MCQ) | RoboFAC Simulation | FD Score93 | 7 | |
| Robotic Failure Analysis | RoboFAC 1.0 (mixed simulated and real-world) | Task Success Rate (Short Horizon)82.74 | 6 | |
| Free-language reasoning | RoboFAC (Real-world) | ROUGE-L (TI)33.8 | 4 | |
| Free-language reasoning | RoboFAC Simulation | ROUGE-L (TI)32.6 | 4 |