Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution
About
Visuomotor policies trained via behavior cloning are vulnerable to covariate shift, where small deviations from expert trajectories can compound into failure. Common strategies to mitigate this issue involve expanding the training distribution through human-in-the-loop corrections or synthetic data augmentation. However, these approaches are often labor-intensive, rely on strong task assumptions, or compromise the quality of imitation. We introduce Latent Policy Barrier, a framework for robust visuomotor policy learning. Inspired by Control Barrier Functions, LPB treats the latent embeddings of expert demonstrations as an implicit barrier separating safe, in-distribution states from unsafe, out-of-distribution (OOD) ones. Our approach decouples the role of precise expert imitation and OOD recovery into two separate modules: a base diffusion policy solely on expert data, and a dynamics model trained on both expert and suboptimal policy rollout data. At inference time, the dynamics model predicts future latent states and optimizes them to stay within the expert distribution. Both simulated and real-world experiments show that LPB improves both policy robustness and data efficiency, enabling reliable manipulation from limited expert data and without additional human correction or annotation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cucumber Peeling | Cucumber Peeling 50 Demos, Bi-Arx5 Dual-arm 1.0 (test) | Task Score81.5 | 13 | |
| Chip Handover | Chip Handover 50 Demos Bi-Arx5 Dual-arm 1.0 (test) | Success Rate30 | 13 | |
| Multi-task Performance Aggregation | Combined Five Tasks (Shoe Lacing, Chip Handover, Cucum. Peeling, Vase Wiping, Lock Opening) 1.0 (average) | Average Performance36.2 | 13 | |
| Lock Opening | Lock Opening 20 Demos Flexiv Rizon4 Single-arm 1.0 (test) | Success Rate20 | 13 | |
| Shoe Lacing | Shoe Lacing 100 Demos, Bi-Arx5 Dual-arm 1.0 (test) | Success Rate10 | 13 | |
| Vase Wiping | Vase Wiping 30 Demos Flexiv Rizon4 Single-arm 1.0 (test) | Task Score39.5 | 13 |