Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video

About

We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from a monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes for each video fragment sequentially by a neural network. A learning-based TSDF fusion module based on gated recurrent units is used to guide the network to fuse features from previous fragments. This design allows the network to capture local smoothness prior and global shape prior of 3D surfaces when sequentially reconstructing the surfaces, resulting in accurate, coherent, and real-time surface reconstruction. The experiments on ScanNet and 7-Scenes datasets show that our system outperforms state-of-the-art methods in terms of both accuracy and speed. To the best of our knowledge, this is the first learning-based system that is able to reconstruct dense coherent 3D geometry in real-time.

Jiaming Sun, Yiming Xie, Linghao Chen, Xiaowei Zhou, Hujun Bao• 2021

Related benchmarks

TaskDatasetResultRank
3D Semantic SegmentationScanNet (test)
mIoU35.1
105
3D Semantic SegmentationScanNet (val)
mIoU37.5
100
3D Geometry ReconstructionScanNet
Accuracy5.8
54
3D Reconstruction7 Scenes--
32
3D Scene Reconstruction7-Scenes (test)
Accuracy6.1
27
2D Depth EstimationScanNet
AbsRel0.063
26
3D Scene ReconstructionScanNet v2 (test)
Accuracy0.054
26
2D Depth Estimation7 Scenes
Abs Rel0.155
20
Depth EstimationTUM-RGBD
Abs Rel Error0.106
16
3D ReconstructionScanNet
F-score61.7
13
Showing 10 of 29 rows

Other info

Code

Follow for update