Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SimpleRecon: 3D Reconstruction Without 3D Convolutions

About

Traditionally, 3D indoor scene reconstruction from posed images happens in two phases: per-image depth estimation, followed by depth merging and surface reconstruction. Recently, a family of methods have emerged that perform reconstruction directly in final 3D volumetric feature space. While these methods have shown impressive reconstruction results, they rely on expensive 3D convolutional layers, limiting their application in resource-constrained environments. In this work, we instead go back to the traditional route, and show how focusing on high quality multi-view depth prediction leads to highly accurate 3D reconstructions using simple off-the-shelf depth fusion. We propose a simple state-of-the-art multi-view depth estimator with two main contributions: 1) a carefully-designed 2D CNN which utilizes strong image priors alongside a plane-sweep feature volume and geometric losses, combined with 2) the integration of keyframe and geometric metadata into the cost volume which allows informed depth plane scoring. Our method achieves a significant lead over the current state-of-the-art for depth estimation and close or better for 3D reconstruction on ScanNet and 7-Scenes, yet still allows for online real-time low-memory reconstruction. Code, models and results are available at https://nianticlabs.github.io/simplerecon

Mohamed Sayed, John Gibson, Jamie Watson, Victor Prisacariu, Michael Firman, Cl\'ement Godard• 2022

Related benchmarks

TaskDatasetResultRank
3D Scene Reconstruction7-Scenes (test)
Accuracy8
27
3D Scene ReconstructionScanNet v2 (test)
Accuracy0.065
26
3D ReconstructionScanNet
F-score68.3
13
Surface ReconstructionASE (val)
Accuracy53.9
10
Surface ReconstructionADT dataset
Accuracy32.6
10
3D surface reconstructionScanNet V2 (testplanes)
Chamfer Distance5.4
7
3D Scene ReconstructionNYU 9 unseen scenes v2 (test)
C-l10.051
7
3D Plane ReconstructionScanNet V2 (testplanes)
Fidelity9.42
7
3D Plane SegmentationScanNet V2 (testplanes)
VOI2.507
7
3D Mesh Reconstruction Quality EvaluationScanNet (test)
Aspe Mean0.436
6
Showing 10 of 13 rows

Other info

Follow for update