Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DPSNet: End-to-end Deep Plane Sweep Stereo

About

Multiview stereo aims to reconstruct scene depth from images acquired by a camera under arbitrary motion. Recent methods address this problem through deep learning, which can utilize semantic cues to deal with challenges such as textureless and reflective regions. In this paper, we present a convolutional neural network called DPSNet (Deep Plane Sweep Network) whose design is inspired by best practices of traditional geometry-based approaches for dense depth reconstruction. Rather than directly estimating depth and/or optical flow correspondence from image pairs as done in many previous deep learning methods, DPSNet takes a plane sweep approach that involves building a cost volume from deep features using the plane sweep algorithm, regularizing the cost volume via a context-aware cost aggregation, and regressing the dense depth map from the cost volume. The cost volume is constructed using a differentiable warping process that allows for end-to-end training of the network. Through the effective incorporation of conventional multiview stereo concepts within a deep learning framework, DPSNet achieves state-of-the-art reconstruction results on a variety of challenging datasets.

Sunghoon Im, Hae-Gon Jeon, Stephen Lin, In So Kweon• 2019

Related benchmarks

TaskDatasetResultRank
Depth EstimationScanNet (test)
Abs Rel0.0986
65
3D Geometry ReconstructionScanNet
Accuracy28.4
54
3D Scene ReconstructionScanNet v2 (test)
Accuracy0.177
26
2D Depth EstimationScanNet
AbsRel0.087
26
Multi-view Depth EstimationScanNet (test)
Abs Rel0.094
23
Depth EstimationSun3D (test)
Abs Rel12.74
22
2D Depth Estimation7 Scenes
Abs Rel0.1991
20
Depth Estimation7-Scenes (test)
Abs Rel0.1675
19
Depth EstimationSUN3D
Abs Rel0.147
13
Depth EstimationScenes11 (test)
L1 Relative Error0.05
12
Showing 10 of 40 rows

Other info

Follow for update