Generalizable Humanoid Manipulation with 3D Diffusion Policies

About

Humanoid robots capable of autonomous operation in diverse environments have long been a goal for roboticists. However, autonomous manipulation by humanoid robots has largely been restricted to one specific scene, primarily due to the difficulty of acquiring generalizable skills and the expensiveness of in-the-wild humanoid robot data. In this work, we build a real-world robotic system to address this challenging problem. Our system is mainly an integration of 1) a whole-upper-body robotic teleoperation system to acquire human-like robot data, 2) a 25-DoF humanoid robot platform with a height-adjustable cart and a 3D LiDAR sensor, and 3) an improved 3D Diffusion Policy learning algorithm for humanoid robots to learn from noisy human data. We run more than 2000 episodes of policy rollouts on the real robot for rigorous policy evaluation. Empowered by this system, we show that using only data collected in one single scene and with only onboard computing, a full-sized humanoid robot can autonomously perform skills in diverse real-world scenarios. Videos are available at https://humanoid-manipulation.github.io .

Yanjie Ze, Zixuan Chen, Wenhao Wang, Tianyi Chen, Xialin He, Ying Yuan, Xue Bin Peng, Jiajun Wu• 2024

Related benchmarks

Task	Dataset	Result
Insertion	Simulation	Insertion Success Rate47	19
Robot Manipulation	Adroit	Success Rate62.7	18
Insertion	Real-world	Success Rate75	16
Pick-&-Place	Real-world	Success Rate55	15
Lift	Simulation	Success Rate65	14
Robotic Manipulation	Adroit and Meta-World Average (simulation)	Success Rate67	9
Move Card	Real-world	Success Rate80	7
Move Card	Simulation v1 (100 trials)	Task Success Rate86	7
Grasp Handle	Real-world	Success Rate65	7
Grasp Handle	Simulation v1 (100 trials)	Task Success Rate73	7

Showing 10 of 26 rows

Other info

Follow for update

@wizwand_team Discord