Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

One-Step Flow Policy: Self-Distillation for Fast Visuomotor Policies

About

Generative flow and diffusion models provide the continuous, multimodal action distributions needed for high-precision robotic policies. However, their reliance on iterative sampling introduces severe inference latency, degrading control frequency and harming performance in time-sensitive manipulation. To address this problem, we propose the One-Step Flow Policy (OFP), a from-scratch self-distillation framework for high-fidelity, single-step action generation without a pre-trained teacher. OFP unifies a self-consistency loss to enforce coherent transport across time intervals, and a self-guided regularization to sharpen predictions toward high-density expert modes. In addition, a warm-start mechanism leverages temporal action correlations to minimize the generative transport distance. Evaluations across 56 diverse simulated manipulation tasks demonstrate that a one-step OFP achieves state-of-the-art results, outperforming 100-step diffusion and flow policies while accelerating action generation by over $100\times$. We further integrate OFP into the $\pi_{0.5}$ model on RoboTwin 2.0, where one-step OFP surpasses the original 10-step policy. These results establish OFP as a practical, scalable solution for highly accurate and low-latency robot control.

Shaolong Li, Lichao Sun, Yongchao Chen• 2026

Related benchmarks

TaskDatasetResultRank
Robotic ManipulationRoboTwin 2.0 (test)
Average Success Rate94.3
30
3D pointcloud manipulationMetaWorld
Success Rate (Easy)87.9
30
Robotic ManipulationDexArt
Success Rate (Bucket)39
29
Robot ManipulationMetaWorld, Adroit, and Dexart Combined
Average Success Rate71.6
25
Dexterous ManipulationAdroit
Hammer Success100
17
Tool-based ManipulationDexArt
DexArt Avg Success Rate64.3
11
2D image-based manipulationAdroit & DexArt 2D image-based manipulation
Door Success Rate68.3
8
3D pointcloud manipulationAdroit
Success Rate85
8
Place empty cupRoboTwin 2.0 (test)
Success Rate93.7
8
Adjust BottleRoboTwin 2.0 (test)
Success Rate99
5
Showing 10 of 11 rows

Other info

Follow for update