Human-Robot Copilot for Data-Efficient Imitation Learning

About

Collecting human demonstrations via teleoperation is a common approach for teaching robots task-specific skills. However, when only a limited number of demonstrations are available, policies are prone to entering out-of-distribution (OOD) states due to compounding errors or environmental stochasticity. Existing interactive imitation learning or human-in-the-loop methods try to address this issue by following the Human-Gated DAgger (HG-DAgger) paradigm, an approach that augments demonstrations through selective human intervention during policy execution. Nevertheless, these approaches struggle to balance dexterity and generality: they either provide fine-grained corrections but are limited to specific kinematic structures, or achieve generality at the cost of precise control. To overcome this limitation, we propose the Human-Robot Copilot framework that can leverage a scaling factor for dexterous teleoperation while maintaining compatibility with a wide range of industrial and research manipulators. Experimental results demonstrate that our framework achieves higher performance with the same number of demonstration trajectories. Moreover, since corrective interventions are required only intermittently, the overall data collection process is more efficient and less time-consuming.

Rui Yan, Zaitian Gongye, Lars Paulsen, Xuxin Cheng, Xiaolong Wang• 2026

Related benchmarks

Task	Dataset	Result
Sort object	Real-world	Success Rate (S1)78	5
Tower of Hanoi Insertion	Real-world	S1 Success Rate90	5
Nut Assembly	Robomimic Simulation Nut Assembly	Stage 1 Success Rate56	3
Can Pick & Place	Robomimic Simulation Can Pick & Place	Stage 1 Success Rate84	3

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord