Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation
About
Humans can accomplish complex contact-rich tasks using vision and touch, with highly reactive capabilities such as fast response to external changes and adaptive control of contact forces; however, this remains challenging for robots. Existing visual imitation learning (IL) approaches rely on action chunking to model complex behaviors, which lacks the ability to respond instantly to real-time tactile feedback during the chunk execution. Furthermore, most teleoperation systems struggle to provide fine-grained tactile / force feedback, which limits the range of tasks that can be performed. To address these challenges, we introduce TactAR, a low-cost teleoperation system that provides real-time tactile feedback through Augmented Reality (AR), along with Reactive Diffusion Policy (RDP), a novel slow-fast visual-tactile imitation learning algorithm for learning contact-rich manipulation skills. RDP employs a two-level hierarchy: (1) a slow latent diffusion policy for predicting high-level action chunks in latent space at low frequency, (2) a fast asymmetric tokenizer for closed-loop tactile feedback control at high frequency. This design enables both complex trajectory modeling and quick reactive behavior within a unified framework. Through extensive evaluation across three challenging contact-rich tasks, RDP significantly improves performance compared to state-of-the-art visual IL baselines. Furthermore, experiments show that RDP is applicable across different tactile / force sensors. Code and videos are available on https://reactive-diffusion-policy.github.io.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Robot Manipulation | Adroit | Success Rate70 | 18 | |
| Vase Wiping | Vase Wiping 30 Demos Flexiv Rizon4 Single-arm 1.0 (test) | Task Score47.5 | 13 | |
| Chip Handover | Chip Handover 50 Demos Bi-Arx5 Dual-arm 1.0 (test) | Success Rate20 | 13 | |
| Lock Opening | Lock Opening 20 Demos Flexiv Rizon4 Single-arm 1.0 (test) | Success Rate10 | 13 | |
| Multi-task Performance Aggregation | Combined Five Tasks (Shoe Lacing, Chip Handover, Cucum. Peeling, Vase Wiping, Lock Opening) 1.0 (average) | Average Performance30.3 | 13 | |
| Cucumber Peeling | Cucumber Peeling 50 Demos, Bi-Arx5 Dual-arm 1.0 (test) | Task Score74 | 13 | |
| Shoe Lacing | Shoe Lacing 100 Demos, Bi-Arx5 Dual-arm 1.0 (test) | Success Rate0.00e+0 | 13 | |
| Robotic Manipulation | Adroit and Meta-World Average (simulation) | Success Rate73 | 9 | |
| Robotic Manipulation | Meta-World simulation | Success Rate76 | 6 | |
| Robotic Manipulation | Real-world robot experiments | Pick & Place Success Rate73.3 | 4 |