SkiP: When to Skip and When to Refine for Efficient Robot Manipulation
About
Previous imitation learning policies predict future actions at every control step, whether in smooth motion phases or precise, contact-rich operation phases. This uniform treatment is wasteful: most steps in a manipulation trajectory traverse free space and carry little task-relevant information, while a small fraction of \emph{key} steps around contacts, grasps, and alignment demand dense, high-resolution prediction. We propose a novel \emph{action relabeling} mechanism: at each timestep in a skip segment, we replace the behavior cloning target with the action at the entrance of the next key segment, enabling the policy to leap over redundant steps in a single decision. The resulting \textbf{Skip Policy (SkiP)} dynamically leaps over skip segments and intensively refines actions in key segments, within a single unified network requiring no learned skip planner or hierarchical structure. To automatically partition demonstrations into key and skip segments without manual annotation, we introduce \emph{Motion Spectrum Keying} (MSK), a fast, task-agnostic procedure that detects local motion complexity from action signals. Extensive experiments across 72 simulated manipulation tasks and three real-robot tasks show that SkiP reduces executed steps by $15$--$40\%$ while matching or improving success rates across various policy backbones. Project page: \texttt{https://pgq18.github.io/SkiP-page/}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Robot Manipulation | RLBench 10 | Success Rate100 | 20 | |
| Bimanual Robot Manipulation | RoboTwin 2.0 | Success Rate: Handover Block87 | 14 | |
| Robot Manipulation | Robomimic image observation | -- | 9 | |
| Pour Water | Real-robot tabletop tasks 15 rollouts | Success Rate46.7 | 4 | |
| Stack bowls | Real-robot tabletop tasks 15 rollouts | Success Rate53.3 | 4 | |
| tidy-up-desk | Real-robot tabletop tasks 15 rollouts | Success Rate (%)73.3 | 4 | |
| Robot Manipulation: close_jar | RLBench-18 1.0 (fine-tuning) | Success Rate (SR)42.67 | 2 | |
| Robot Manipulation: meat_off_grill | RLBench 18 (fine-tuning) | Success Rate (SR)72 | 2 | |
| Robot Manipulation: Overall (avg) | RLBench 18 (fine-tuning) | Success Rate (SR)20.96 | 2 | |
| Robot Manipulation: put_money_in_safe | RLBench 18 (fine-tuning) | Success Rate36 | 2 |