| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Robotic Manipulation | RoboTwin 2.0 | Average Success Rate88 | 64 | |
| Robotic Manipulation | RoboTwin 1.0 | Success Rate100 | 48 | |
| Robot Manipulation | RoboTwin Clean 2.0 | Place Dual Shoes Success96 | 24 | |
| Robotic Manipulation | RoboTwin 2.0 (test) | Average Success Rate94.3 | 22 | |
| Robot Manipulation | RoboTwin Randomized 2.0 | Success Rate: Place Dual Shoes96 | 20 | |
| Robotic Manipulation | RoboTwin | Success Rate80.7 | 13 | |
| Progress Estimation | RoboTwin | MRA31.84 | 12 | |
| Robotic Manipulation | RoboTwin Easy 2.0 | Adjust Bottle Success Rate97 | 11 | |
| Robotic Manipulation | RoboTwin Hard 2.0 | Overall Success Rate92.1 | 9 | |
| Jailbreak Attack | RoboTwin Motus | MFR2.4 | 8 | |
| Jailbreak Attack | RoboTwin LingBot-VA | Misclassification Failure Rate (MFR)1.6 | 8 | |
| Embodied Task Planning | RoboTwin Library Scene | TEI1.32 | 8 | |
| Embodied Task Planning | RoboTwin Pet Shop Scene | TEI1.36 | 8 | |
| Embodied Task Planning | RoboTwin Hotel Scene | TEI1.12 | 8 | |
| Embodied AI Task Planning | RoboTwin Disaster Rescue Scene (test) | TEI1.35 | 8 | |
| Embodied AI Task Planning | RoboTwin Hospital Scene (test) | TEI1.29 | 8 | |
| Embodied AI Task Planning | RoboTwin Supermarket Scene (test) | TEI1.34 | 8 | |
| Robotic Manipulation | RoboTwin simulation 1.0 | Hammer Success Rate89 | 8 | |
| Robot manipulation | RoboTwin Hard 2.0 | Beat Block Hammer Success Rate42 | 8 | |
| Robotic Manipulation | RoboTwin | Block Hammer Beat Success Rate77 | 7 | |
| Robotic Task Execution | RoboTwin easy-mode 2.0 (evaluation) | Average Execution Duration (s)27 | 7 | |
| Robotic Manipulation | RoboTwin easy-mode 2.0 (evaluation) | Adjust Bottle Success98 | 7 | |
| High-level Planning | RoboTwin Hard | TEI2.4 | 7 | |
| High-level Planning | RoboTwin Medium | Task Execution Index (TEI)2.1 | 7 | |
| High-level Planning | RoboTwin Easy | Task Execution Index (TEI)10.6 | 7 |