Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning

About

The increasing complexity of tasks in robotics demands efficient strategies for multitask and continual learning. Traditional models typically rely on a universal policy for all tasks, facing challenges such as high computational costs and catastrophic forgetting when learning new tasks. To address these issues, we introduce a sparse, reusable, and flexible policy, Sparse Diffusion Policy (SDP). By adopting Mixture of Experts (MoE) within a transformer-based diffusion policy, SDP selectively activates experts and skills, enabling efficient and task-specific learning without retraining the entire model. SDP not only reduces the burden of active parameters but also facilitates the seamless integration and reuse of experts across various tasks. Extensive experiments on diverse tasks in both simulations and real world show that SDP 1) excels in multitask scenarios with negligible increases in active parameters, 2) prevents forgetting in continual learning of new tasks, and 3) enables efficient task transfer, offering a promising solution for advanced robotic applications. Demos and codes can be found in https://forrest-110.github.io/sparse_diffusion_policy/.

Yixiao Wang, Yifei Zhang, Mingxiao Huo, Ran Tian, Xiang Zhang, Yichen Xie, Chenfeng Xu, Pengliang Ji, Wei Zhan, Mingyu Ding, Masayoshi Tomizuka• 2024

Related benchmarks

Task	Dataset	Result
Robotic Manipulation	RoboTwin 2.0	Average Success Rate44	100
Robotic Manipulation	RLBench (test)	Average Success Rate42.1	49
Bimanual Multi-Task Learning	RoboTwin and RLBench average over all tasks 2	Np154.4	7
Bimanual Multi-Task Learning	RLBench 2	Tray Success Rate15	6
Robotic Manipulation	MimicGen (test)	Square Success Rate74	6
Robotic Manipulation	MimicGen	Square74	6
Multi-task Robotic Manipulation	RoboTwin Simulation 2.0 (test)	Adjust bottle95	4
Robot Manipulation	MimicGen	Coffee Success Rate82	4
Multitask Robot Manipulation	MetaWorld (test)	Door Open80	4
Multi-Skill Policy Pre-Training	Real-world robot tasks	Task 1 Success Rate82	3

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord