Adaptor: Advancing Assistive Teleoperation with Few-Shot Learning and Cross-Operator Generalization
About
Assistive teleoperation enhances efficiency via shared control, yet inter-operator variability, stemming from diverse habits and expertise, induces highly heterogeneous trajectory distributions that undermine intent recognition stability. We present Adaptor, a few-shot framework for robust cross-operator intent recognition. The Adaptor bridges the domain gap through two stages: (i) preprocessing, which models intent uncertainty by synthesizing trajectory perturbations via noise injection and performs geometry-aware keyframe extraction; and (ii) policy learning, which encodes the processed trajectories with an Intention Expert and fuses them with the pre-trained vision-language model context to condition an Action Expert for action generation. Experiments on real-world and simulated benchmarks demonstrate that Adaptor achieves state-of-the-art performance, improving success rates and efficiency over baselines. Moreover, the method exhibits low variance across operators with varying expertise, demonstrating robust cross-operator generalization.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cube Stacking | Realman Cube Stacking | Success Rate86.07 | 3 | |
| Cube Transfer | ALOHA simulator Cube Transfer | Success Rate91.48 | 3 | |
| Insertion | ALOHA simulator Insertion | Success Rate67.25 | 3 | |
| Pen Organization | Realman Pen Organization | Success Rate90.27 | 3 | |
| Pen Uncapping | PIPER Pen Uncapping | Success Rate89.36 | 3 | |
| Shirt Folding | PIPER Shirt Folding | Success Rate76.1 | 3 |