Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MVDepthNet: Real-time Multiview Depth Estimation Neural Network

About

Although deep neural networks have been widely applied to computer vision problems, extending them into multiview depth estimation is non-trivial. In this paper, we present MVDepthNet, a convolutional network to solve the depth estimation problem given several image-pose pairs from a localized monocular camera in neighbor viewpoints. Multiview observations are encoded in a cost volume and then combined with the reference image to estimate the depth map using an encoder-decoder network. By encoding the information from multiview observations into the cost volume, our method achieves real-time performance and the flexibility of traditional methods that can be applied regardless of the camera intrinsic parameters and the number of images. Geometric data augmentation is used to train MVDepthNet. We further apply MVDepthNet in a monocular dense mapping system that continuously estimates depth maps using a single localized moving camera. Experiments show that our method can generate depth maps efficiently and precisely.

Kaixuan Wang, Shaojie Shen• 2018

Related benchmarks

TaskDatasetResultRank
Depth EstimationScanNet (test)
Abs Rel0.1116
65
3D Geometry ReconstructionScanNet
Accuracy24
54
3D Scene ReconstructionScanNet v2 (test)
Accuracy0.205
26
2D Depth EstimationScanNet
AbsRel0.098
26
Depth EstimationSun3D (test)
Abs Rel13.77
22
2D Depth Estimation7 Scenes
Abs Rel0.1925
20
Depth Estimation7-Scenes (test)
Abs Rel0.1905
19
Multi-view Depth EstimationScanNet 16 (test)
Abs Rel Error0.1116
12
3D Geometry ReconstructionScanNet (Atlas split)
Completeness0.04
11
Depth EstimationScanNet v2 (test)
Abs Diff0.1648
10
Showing 10 of 14 rows

Other info

Follow for update