Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution

About

Visuomotor policies trained via behavior cloning are vulnerable to covariate shift, where small deviations from expert trajectories can compound into failure. Common strategies to mitigate this issue involve expanding the training distribution through human-in-the-loop corrections or synthetic data augmentation. However, these approaches are often labor-intensive, rely on strong task assumptions, or compromise the quality of imitation. We introduce Latent Policy Barrier, a framework for robust visuomotor policy learning. Inspired by Control Barrier Functions, LPB treats the latent embeddings of expert demonstrations as an implicit barrier separating safe, in-distribution states from unsafe, out-of-distribution (OOD) ones. Our approach decouples the role of precise expert imitation and OOD recovery into two separate modules: a base diffusion policy solely on expert data, and a dynamics model trained on both expert and suboptimal policy rollout data. At inference time, the dynamics model predicts future latent states and optimizes them to stay within the expert distribution. Both simulated and real-world experiments show that LPB improves both policy robustness and data efficiency, enabling reliable manipulation from limited expert data and without additional human correction or annotation.

Zhanyi Sun, Shuran Song• 2025

Related benchmarks

TaskDatasetResultRank
Cucumber PeelingCucumber Peeling 50 Demos, Bi-Arx5 Dual-arm 1.0 (test)
Task Score81.5
13
Chip HandoverChip Handover 50 Demos Bi-Arx5 Dual-arm 1.0 (test)
Success Rate30
13
Multi-task Performance AggregationCombined Five Tasks (Shoe Lacing, Chip Handover, Cucum. Peeling, Vase Wiping, Lock Opening) 1.0 (average)
Average Performance36.2
13
Lock OpeningLock Opening 20 Demos Flexiv Rizon4 Single-arm 1.0 (test)
Success Rate20
13
Shoe LacingShoe Lacing 100 Demos, Bi-Arx5 Dual-arm 1.0 (test)
Success Rate10
13
Vase WipingVase Wiping 30 Demos Flexiv Rizon4 Single-arm 1.0 (test)
Task Score39.5
13
Showing 6 of 6 rows

Other info

Follow for update