Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images

About

In this paper, we study the problem of 3D scene geometry decomposition and manipulation from 2D views. By leveraging the recent implicit neural representation techniques, particularly the appealing neural radiance fields, we introduce an object field component to learn unique codes for all individual objects in 3D space only from 2D supervision. The key to this component is a series of carefully designed loss functions to enable every 3D point, especially in non-occupied space, to be effectively optimized even without 3D labels. In addition, we introduce an inverse query algorithm to freely manipulate any specified 3D object shape in the learned scene representation. Notably, our manipulation algorithm can explicitly tackle key issues such as object collisions and visual occlusions. Our method, called DM-NeRF, is among the first to simultaneously reconstruct, decompose, manipulate and render complex 3D scenes in a single pipeline. Extensive experiments on three datasets clearly show that our method can accurately decompose all 3D objects from 2D views, allowing any interested object to be freely manipulated in 3D space such as translation, rotation, size adjustment, and deformation.

Bing Wang, Lu Chen, Bo Yang• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationScanNet--
59
Novel View SynthesisScanNet
PSNR28.21
58
ReconstructionReplica instance segmentation setting
PSNR40.66
16
Semantic View Synthesis (Novel View)ScanNet V2 (val)
mIoU93.5
12
3D Instance SegmentationReplica3D
PQ Scene44.1
7
3D Instance SegmentationScanNet
PQscene41.7
7
Semantic segmentationReplica v1 (test)
mIoU56
6
Semantic segmentationHyperSim v1 (test)
mIoU57.6
6
Semantic segmentationScanNet v1 (test)
mIoU49.5
6
Novel View SynthesisHyperSim v1 (test)
PSNR28.1
5
Showing 10 of 19 rows

Other info

Follow for update