| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LIBERO | EO1 | Spatial Success Rate99.7 | 527 | 5d ago | |
| LIBERO-Plus | OpenVLA-OFT | Language Understanding Score99 | 249 | 2d ago | |
| CALVIN ABCD->D | MCIL | Avg Length0.4 | 130 | 2d ago | |
| RoboTwin 2.0 | SANTS | Average Success Rate94.4 | 100 | 6d ago | |
| LIBERO-Long | FULLSOFT | Success Rate96.24 | 91 | 1d ago | |
| LIBERO v1 (test) | ElasticFlow | Average Success Rate98.5 | 83 | 1d ago | |
| LIBERO Spatial Object Goal Long | Agentic-VLA | Overall Success Rate (Long)98.1 | 82 | 8d ago | |
| Calvin ABC-D | CoVAR | Task-1 Score100 | 71 | 21d ago | |
| RLBench | MALLVi | Place Cups Success96 | 63 | 27d ago | |
| LIBERO (test) | PriorVLA | Object Success Rate99.8 | 58 | 21d ago | |
| LIBERO 1.0 (test) | QuoVLA | Long98.7 | 57 | 8d ago | |
| LIBERO-10 | AWP | Success Rate96 | 54 | 21d ago | |
| LIBERO | Spatial Forcing | Spatial Success Rate99.4 | 52 | 2d ago | |
| SIMPLER Visual Matching WidowX robot | LoLA | Put Spoon on Towel Score95.8 | 51 | 1mo ago | |
| RLBench (test) | TGM-VLA | Average Success Rate90.5 | 49 | 2mo ago | |
| RoboTwin 1.0 | DP+HiPolicy | Success Rate100 | 48 | 1mo ago | |
| SimplerEnv | OneVLA | Success Rate: Spoon on Towel87.5 | 42 | 1d ago | |
| LIBERO-Goal | BASE | Success Rate96.8 | 42 | 1d ago | |
| CALVIN D→D | MDT-V | Average Length4.52 | 40 | 1mo ago | |
| RoboCasa | Consistency-Exploring | Average Success Rate68 | 39 | 21d ago | |
| Franka-Kitchen | BYOL | Avg Success Rate93.75 | 39 | 1mo ago | |
| RoboTwin (random-scene) | NoiseGate | Success Rate100 | 36 | 23d ago | |
| SIMPLER Google Robot VA | DAM-VLA | Pick Up Coke Can Success Rate98 | 35 | 2mo ago | |
| LIBERO-PLUS (test) | Libra-VLA | Language Robustness Score92.7 | 32 | 8d ago | |
| SIMPLER Visual Matching | Hard-TIES | Average Success Rate78.1 | 31 | 5d ago |