Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Diffusion-based Generation, Optimization, and Planning in 3D Scenes

About

We introduce SceneDiffuser, a conditional generative model for 3D scene understanding. SceneDiffuser provides a unified model for solving scene-conditioned generation, optimization, and planning. In contrast to prior works, SceneDiffuser is intrinsically scene-aware, physics-based, and goal-oriented. With an iterative sampling strategy, SceneDiffuser jointly formulates the scene-aware generation, physics-based optimization, and goal-oriented planning via a diffusion-based denoising process in a fully differentiable fashion. Such a design alleviates the discrepancies among different modules and the posterior collapse of previous scene-conditioned generative models. We evaluate SceneDiffuser with various 3D scene understanding tasks, including human pose and motion generation, dexterous grasp generation, path planning for 3D navigation, and motion planning for robot arms. The results show significant improvements compared with previous models, demonstrating the tremendous potential of SceneDiffuser for the broad community of 3D scene understanding.

Siyuan Huang, Zan Wang, Puhao Li, Baoxiong Jia, Tengyu Liu, Yixin Zhu, Wei Liang, Song-Chun Zhu• 2023

Related benchmarks

TaskDatasetResultRank
Dexterous Grasp GenerationDexGraspNet 1.0 (test)
Penetration Error0.107
12
Dexterous Grasp GenerationDexGYSNet
P-FID7.93
12
Dexterous GraspingMultiDex official (test)
Success Rate (6)69.8
12
Dexterous GraspingUniDexGrasp official (test)
Success Rate (6 DOF)28.3
12
Dexterous GraspingDexGRAB official (test)
Success Rate (Suc.6)39.1
12
Dexterous GraspingRealDex official (test)
Success Rate (Suc.6)0.217
12
Motion GenerationTRUMANS (test)
Diversity (Div)2.03
9
Task-Oriented GraspingOakInk-PartialPC (test)
Penetration Volume (cm3)6.52
6
Task-Oriented GraspingNovel Object Categories
Penetration Volume (cm3)12.43
6
Dexterous GraspingDexGraspNet (test)
Success Rate (6 DoF)26.6
6
Showing 10 of 13 rows

Other info

Follow for update