| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| CALVIN ABC→D Zero-shot | FOFPred | Task 1 Success Rate98.8 | 16 | 4d ago | |
| Long-horizon tasks (test) | MIND-V | PFC Score0.445 | 6 | 4d ago | |
| RLBench | CoT-VLA | Pick Cup Success Rate86 | 5 | 4d ago | |
| Real-World Execution | Action-Sketcher | Tidy Table Success Rate52 | 4 | 4d ago | |
| RoboTwin Simulation 2.0 | Action-Sketcher | Stack Blocks34.5 | 4 | 4d ago | |
| AIRBOT Play real-world | CLOVER | Sub-task 1 Success Rate93.3 | 4 | 4d ago | |
| SayCan Kitchen1 | SayCan w/ Gato | Planning Success Rate87 | 4 | 4d ago | |
| Real-world Unseen Lighting | PALM | Success Rate (Step 1)80 | 3 | 4d ago | |
| Real-world (Visual Distraction) | PALM | Success Rate (Step 1)85 | 3 | 4d ago | |
| Real-world Random Localization | PALM | Success Rate (Step 1)70 | 3 | 4d ago | |
| SayCan Kitchen2 | SayCan w/ Gato | Planning Success Rate0.87 | 3 | 4d ago |