Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture

About

Reconstructing detailed 3D scenes from single-view images remains a challenging task due to limitations in existing approaches, which primarily focus on geometric shape recovery, overlooking object appearances and fine shape details. To address these challenges, we propose a novel framework for simultaneous high-fidelity recovery of object shapes and textures from single-view images. Our approach utilizes the proposed Single-view neural implicit Shape and Radiance field (SSR) representations to leverage both explicit 3D shape supervision and volume rendering of color, depth, and surface normal images. To overcome shape-appearance ambiguity under partial observations, we introduce a two-stage learning curriculum incorporating both 3D and 2D supervisions. A distinctive feature of our framework is its ability to generate fine-grained textured meshes while seamlessly integrating rendering capabilities into the single-view 3D reconstruction model. This integration enables not only improved textured 3D object reconstruction by 27.7% and 11.6% on the 3D-FRONT and Pix3D datasets, respectively, but also supports the rendering of images from novel viewpoints. Beyond individual objects, our approach facilitates composing object-level representations into flexible scene representations, thereby enabling applications such as holistic scene understanding and 3D scene editing. We conduct extensive experiments to demonstrate the effectiveness of our method.

Yixin Chen, Junfeng Ni, Nan Jiang, Yaowei Zhang, Yixin Zhu, Siyuan Huang• 2023

Related benchmarks

TaskDatasetResultRank
3D Shape ReconstructionPix3D (test)
F-Score59.71
9
Scene GenerationMIDI (test)
CD-S14
9
Single-image 3D scene generation3D-Front synthetic (test)
CD (Shape)0.14
8
Single-image 3D scene generationBlendSwap synthetic (test)
CD-S0.132
8
Object Reconstruction (Chamfer Distance ↓)Pix3D (test)
Mean CD21.79
5
Object Reconstruction (Normal Consistency ↑)Pix3D (test)
Normal Consistency (NC)77.8
5
Single-view 3D Object ReconstructionPix3D 10% labeled data (S1)
Chair Accuracy16.1
5
Showing 7 of 7 rows

Other info

Follow for update