Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Human-Robot Copilot for Data-Efficient Imitation Learning

About

Collecting human demonstrations via teleoperation is a common approach for teaching robots task-specific skills. However, when only a limited number of demonstrations are available, policies are prone to entering out-of-distribution (OOD) states due to compounding errors or environmental stochasticity. Existing interactive imitation learning or human-in-the-loop methods try to address this issue by following the Human-Gated DAgger (HG-DAgger) paradigm, an approach that augments demonstrations through selective human intervention during policy execution. Nevertheless, these approaches struggle to balance dexterity and generality: they either provide fine-grained corrections but are limited to specific kinematic structures, or achieve generality at the cost of precise control. To overcome this limitation, we propose the Human-Robot Copilot framework that can leverage a scaling factor for dexterous teleoperation while maintaining compatibility with a wide range of industrial and research manipulators. Experimental results demonstrate that our framework achieves higher performance with the same number of demonstration trajectories. Moreover, since corrective interventions are required only intermittently, the overall data collection process is more efficient and less time-consuming.

Rui Yan, Zaitian Gongye, Lars Paulsen, Xuxin Cheng, Xiaolong Wang• 2026

Related benchmarks

TaskDatasetResultRank
Sort objectReal-world
Success Rate (S1)78
5
Tower of Hanoi InsertionReal-world
S1 Success Rate90
5
Nut AssemblyRobomimic Simulation Nut Assembly
Stage 1 Success Rate56
3
Can Pick & PlaceRobomimic Simulation Can Pick & Place
Stage 1 Success Rate84
3
Showing 4 of 4 rows

Other info

Follow for update