DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising
About
Understanding and generating 3D objects as compositions of meaningful parts is fundamental to human perception and reasoning. However, most text-to-3D methods overlook the semantic and functional structure of parts. While recent part-aware approaches introduce decomposition, they remain largely geometry-focused, lacking semantic grounding and failing to model how parts align with textual descriptions or their inter-part relations. We propose DreamPartGen, a framework for semantically grounded, part-aware text-to-3D generation. DreamPartGen introduces Duplex Part Latents (DPLs) that jointly model each part's geometry and appearance, and Relational Semantic Latents (RSLs) that capture inter-part dependencies derived from language. A synchronized co-denoising process enforces mutual geometric and semantic consistency, enabling coherent, interpretable, and text-aligned 3D synthesis. Across multiple benchmarks, DreamPartGen delivers state-of-the-art performance in geometric fidelity and text-shape alignment.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Mesh Generation | Objaverse | Chamfer Distance0.141 | 18 | |
| 3D Object Generation | ShapeNet | Chamfer Distance (CD)0.222 | 10 | |
| Part-level 3D Generation | ShapeNet | Chamfer Distance (CD)0.088 | 7 | |
| 3D Scene Generation | 3D-FRONT Occluded 1.0 | Chamfer Distance0.2321 | 6 | |
| 3D Object Generation | ABO | CD0.101 | 5 | |
| 3D Object Generation | PartRel3D | Chamfer Distance (CD)0.081 | 5 | |
| Part-level 3D object generation | Objaverse | r-FID4.0579 | 5 | |
| Part-level 3D object generation | ShapeNet | r-FID4.9736 | 5 | |
| Part-level 3D object generation | ABO | r-FID4.5632 | 5 | |
| Part-level 3D object generation | PartRel3D | r-FID9.7836 | 5 |