Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation

About

Humans can accomplish complex contact-rich tasks using vision and touch, with highly reactive capabilities such as fast response to external changes and adaptive control of contact forces; however, this remains challenging for robots. Existing visual imitation learning (IL) approaches rely on action chunking to model complex behaviors, which lacks the ability to respond instantly to real-time tactile feedback during the chunk execution. Furthermore, most teleoperation systems struggle to provide fine-grained tactile / force feedback, which limits the range of tasks that can be performed. To address these challenges, we introduce TactAR, a low-cost teleoperation system that provides real-time tactile feedback through Augmented Reality (AR), along with Reactive Diffusion Policy (RDP), a novel slow-fast visual-tactile imitation learning algorithm for learning contact-rich manipulation skills. RDP employs a two-level hierarchy: (1) a slow latent diffusion policy for predicting high-level action chunks in latent space at low frequency, (2) a fast asymmetric tokenizer for closed-loop tactile feedback control at high frequency. This design enables both complex trajectory modeling and quick reactive behavior within a unified framework. Through extensive evaluation across three challenging contact-rich tasks, RDP significantly improves performance compared to state-of-the-art visual IL baselines. Furthermore, experiments show that RDP is applicable across different tactile / force sensors. Code and videos are available on https://reactive-diffusion-policy.github.io.

Han Xue, Jieji Ren, Wendi Chen, Gu Zhang, Yuan Fang, Guoying Gu, Huazhe Xu, Cewu Lu• 2025

Related benchmarks

Task	Dataset	Result
Robotic Manipulation	Real-robot manipulation tasks Aggregate	Average Success Rate (Avg SR)87	19
Robot Manipulation	Adroit	Success Rate70	18
Robotic Manipulation	Wipe Vase	Success Rate85	14
Vase Wiping	Vase Wiping 30 Demos Flexiv Rizon4 Single-arm 1.0 (test)	Task Score47.5	13
Chip Handover	Chip Handover 50 Demos Bi-Arx5 Dual-arm 1.0 (test)	Success Rate20	13
Lock Opening	Lock Opening 20 Demos Flexiv Rizon4 Single-arm 1.0 (test)	Success Rate10	13
Multi-task Performance Aggregation	Combined Five Tasks (Shoe Lacing, Chip Handover, Cucum. Peeling, Vase Wiping, Lock Opening) 1.0 (average)	Average Performance30.3	13
Cucumber Peeling	Cucumber Peeling 50 Demos, Bi-Arx5 Dual-arm 1.0 (test)	Task Score74	13
Shoe Lacing	Shoe Lacing 100 Demos, Bi-Arx5 Dual-arm 1.0 (test)	Success Rate0.00e+0	13
Charger Plugging	Charger Plugging	Success Rate (SR)100	11

Showing 10 of 59 rows

Other info

Follow for update

@wizwand_team Discord