Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

P$^{2}$Net: Patch-match and Plane-regularization for Unsupervised Indoor Depth Estimation

About

This paper tackles the unsupervised depth estimation task in indoor environments. The task is extremely challenging because of the vast areas of non-texture regions in these scenes. These areas could overwhelm the optimization process in the commonly used unsupervised depth estimation framework proposed for outdoor environments. However, even when those regions are masked out, the performance is still unsatisfactory. In this paper, we argue that the poor performance suffers from the non-discriminative point-based matching. To this end, we propose P$^2$Net. We first extract points with large local gradients and adopt patches centered at each point as its representation. Multiview consistency loss is then defined over patches. This operation significantly improves the robustness of the network training. Furthermore, because those textureless regions in indoor scenes (e.g., wall, floor, roof, \etc) usually correspond to planar regions, we propose to leverage superpixels as a plane prior. We enforce the predicted depth to be well fitted by a plane within each superpixel. Extensive experiments on NYUv2 and ScanNet show that our P$^2$Net outperforms existing approaches by a large margin. Code is available at \url{https://github.com/svip-lab/Indoor-SfMLearner}.

Zehao Yu, Lei Jin, Shenghua Gao• 2020

Related benchmarks

TaskDatasetResultRank
Depth EstimationNYU v2 (test)
Threshold Accuracy (delta < 1.25)82.3
423
Surface Normal EstimationNYU v2 (test)
Mean Angle Distance (MAD)36.1
206
Depth EstimationKITTI
AbsRel0.126
92
Camera pose estimationScanNet--
61
Depth EstimationScanNet NYUv2-pretrained (test)
RMSE0.42
3
Showing 5 of 5 rows

Other info

Code

Follow for update