EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning

About

Building effective imitation learning methods that enable robots to learn from limited data and still generalize across diverse real-world environments is a long-standing problem in robot learning. We propose Equibot, a robust, data-efficient, and generalizable approach for robot manipulation task learning. Our approach combines SIM(3)-equivariant neural network architectures with diffusion models. This ensures that our learned policies are invariant to changes in scale, rotation, and translation, enhancing their applicability to unseen environments while retaining the benefits of diffusion-based policy learning such as multi-modality and robustness. We show on a suite of 6 simulation tasks that our proposed method reduces the data requirements and improves generalization to novel scenarios. In the real world, with 10 variations of 6 mobile manipulation tasks, we show that our method can easily generalize to novel objects and scenes after learning from just 5 minutes of human demonstrations in each task.

Jingyun Yang, Zi-ang Cao, Congyue Deng, Rika Antonova, Shuran Song, Jeannette Bohg• 2024

Related benchmarks

Task	Dataset	Result
Robot Manipulation	MimicGen Square D2	Success Rate0.00e+0	51
Robot Manipulation	MimicGen Stack Three D1	Success Rate0.00e+0	46
Coffee Making/Handling	Robomimic MimicGen Coffee (D2)	Success Rate0.00e+0	25
Mug Cleanup	Robomimic MimicGen Mug Cleanup (D1)	Success Rate24	20
Coffee Preparation	Robomimic/MimicGen Coffee Prep. (D1)	Success Rate2	20
Robot Manipulation	MimicGen Nut Assembly D0	Success Rate3	15
Robot Manipulation	MimicGen Hammer Cleanup D1	Success Rate14	10
Robot Manipulation	MimicGen Stack D1	Success Rate0.00e+0	10
Robot Manipulation Policy Inference	MimicGen	Coffee Success Rate (D2)2.06	8
Robotic Manipulation	Coffee D2 MimicGen (10° perturbation)	Success Rate0.00e+0	5

Showing 10 of 17 rows

Other info

Follow for update

@wizwand_team Discord