Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning

About

The increasing complexity of tasks in robotics demands efficient strategies for multitask and continual learning. Traditional models typically rely on a universal policy for all tasks, facing challenges such as high computational costs and catastrophic forgetting when learning new tasks. To address these issues, we introduce a sparse, reusable, and flexible policy, Sparse Diffusion Policy (SDP). By adopting Mixture of Experts (MoE) within a transformer-based diffusion policy, SDP selectively activates experts and skills, enabling efficient and task-specific learning without retraining the entire model. SDP not only reduces the burden of active parameters but also facilitates the seamless integration and reuse of experts across various tasks. Extensive experiments on diverse tasks in both simulations and real world show that SDP 1) excels in multitask scenarios with negligible increases in active parameters, 2) prevents forgetting in continual learning of new tasks, and 3) enables efficient task transfer, offering a promising solution for advanced robotic applications. Demos and codes can be found in https://forrest-110.github.io/sparse_diffusion_policy/.

Yixiao Wang, Yifei Zhang, Mingxiao Huo, Ran Tian, Xiang Zhang, Yichen Xie, Chenfeng Xu, Pengliang Ji, Wei Zhan, Mingyu Ding, Masayoshi Tomizuka• 2024

Related benchmarks

TaskDatasetResultRank
Robotic ManipulationRLBench (test)
Average Success Rate42.1
34
Robotic ManipulationRoboTwin 2.0
Pick Diverse Bottles Success Rate37
17
Bimanual Multi-Task LearningRoboTwin and RLBench average over all tasks 2
Np154.4
7
Bimanual Multi-Task LearningRLBench 2
Tray Success Rate15
6
Robotic ManipulationMimicGen (test)
Square Success Rate74
6
Robotic ManipulationMimicGen
Square74
6
Multi-task Robotic ManipulationRoboTwin Simulation 2.0 (test)
Adjust bottle95
4
Multitask Robot ManipulationMetaWorld (test)
Door Open80
4
Multi-Skill Policy Pre-TrainingReal-world robot tasks
Task 1 Success Rate82
3
Continual AdaptationLIBERO
L1 PnP Soup Success Rate57.2
3
Showing 10 of 10 rows

Other info

Follow for update