Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Referring-Aware Visuomotor Policy Learning for Closed-Loop Manipulation

About

This paper addresses a fundamental problem of visuomotor policy learning for robotic manipulation: how to enhance robustness in out-of-distribution execution errors or dynamically re-routing trajectories, where the model relies solely on the original expert demonstrations for training. We introduce the Referring-Aware Visuomotor Policy (ReV), a closed-loop framework that can adapt to unforeseen circumstances by instantly incorporating sparse referring points provided by a human or a high-level reasoning planner. Specifically, ReV leverages the coupled diffusion heads to preserve standard task execution patterns while seamlessly integrating sparse referring via a trajectory-steering strategy. Upon receiving a specific referring point, the global diffusion head firstly generates a sequence of globally consistent yet temporally sparse action anchors, while identifies the precise temporal position for the referring point within this sequence. Subsequently, the local diffusion head adaptively interpolates adjacent anchors based on the current temporal position for specific tasks. This closed-loop process repeats at every execution step, enabling real-time trajectory replanning in response to dynamic changes in the scene. In practice, rather than relying on elaborate annotations, ReV is trained only by applying targeted perturbations to expert demonstrations. Without any additional data or fine-tuning scheme, ReV achieve higher success rates across challenging simulated and real-world tasks.

Jiahua Ma, Yiran Qin, Xin Wen, Yixiong Li, Yuyu Sun, Yulan Guo, Liang Lin, Ruimao Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Robot ManipulationAdroit
Pen Task Score73
50
Robotic ManipulationDexArt
Success Rate (Laptop)87
12
Camera Alignment-via manipulationRoboFactory modified simulated benchmark
RePR100
9
Lift Barrier-via manipulationRoboFactory modified simulated benchmark
RePR100
9
Pick Meat-via manipulationRoboFactory modified simulated benchmark
Reaching Precision Rate100
9
Place Food-via manipulationRoboFactory modified simulated benchmark
Reaching Precision Rate100
9
Robotic ManipulationMetaWorld
Reach Success Rate36
4
Robotic ManipulationRoboFactory
Pick Meat Success Rate94
4
Collecting Objects-viaReal-world (30-trial)
PeRP (Success Count)30
3
Grabbing Rod-viaReal-world (30-trial)
PeRP Success Count30
3
Showing 10 of 13 rows

Other info

Follow for update