Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation

About

Humans can accomplish complex contact-rich tasks using vision and touch, with highly reactive capabilities such as fast response to external changes and adaptive control of contact forces; however, this remains challenging for robots. Existing visual imitation learning (IL) approaches rely on action chunking to model complex behaviors, which lacks the ability to respond instantly to real-time tactile feedback during the chunk execution. Furthermore, most teleoperation systems struggle to provide fine-grained tactile / force feedback, which limits the range of tasks that can be performed. To address these challenges, we introduce TactAR, a low-cost teleoperation system that provides real-time tactile feedback through Augmented Reality (AR), along with Reactive Diffusion Policy (RDP), a novel slow-fast visual-tactile imitation learning algorithm for learning contact-rich manipulation skills. RDP employs a two-level hierarchy: (1) a slow latent diffusion policy for predicting high-level action chunks in latent space at low frequency, (2) a fast asymmetric tokenizer for closed-loop tactile feedback control at high frequency. This design enables both complex trajectory modeling and quick reactive behavior within a unified framework. Through extensive evaluation across three challenging contact-rich tasks, RDP significantly improves performance compared to state-of-the-art visual IL baselines. Furthermore, experiments show that RDP is applicable across different tactile / force sensors. Code and videos are available on https://reactive-diffusion-policy.github.io.

Han Xue, Jieji Ren, Wendi Chen, Gu Zhang, Yuan Fang, Guoying Gu, Huazhe Xu, Cewu Lu• 2025

Related benchmarks

TaskDatasetResultRank
Robot ManipulationAdroit
Success Rate70
18
Vase WipingVase Wiping 30 Demos Flexiv Rizon4 Single-arm 1.0 (test)
Task Score47.5
13
Chip HandoverChip Handover 50 Demos Bi-Arx5 Dual-arm 1.0 (test)
Success Rate20
13
Lock OpeningLock Opening 20 Demos Flexiv Rizon4 Single-arm 1.0 (test)
Success Rate10
13
Multi-task Performance AggregationCombined Five Tasks (Shoe Lacing, Chip Handover, Cucum. Peeling, Vase Wiping, Lock Opening) 1.0 (average)
Average Performance30.3
13
Cucumber PeelingCucumber Peeling 50 Demos, Bi-Arx5 Dual-arm 1.0 (test)
Task Score74
13
Shoe LacingShoe Lacing 100 Demos, Bi-Arx5 Dual-arm 1.0 (test)
Success Rate0.00e+0
13
Robotic ManipulationAdroit and Meta-World Average (simulation)
Success Rate73
9
Robotic ManipulationMeta-World simulation
Success Rate76
6
Robotic ManipulationReal-world robot experiments
Pick & Place Success Rate73.3
4
Showing 10 of 10 rows

Other info

Follow for update