Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Auto-Rectify Network for Unsupervised Indoor Depth Estimation

About

Single-View depth estimation using the CNNs trained from unlabelled videos has shown significant promise. However, excellent results have mostly been obtained in street-scene driving scenarios, and such methods often fail in other settings, particularly indoor videos taken by handheld devices. In this work, we establish that the complex ego-motions exhibited in handheld settings are a critical obstacle for learning depth. Our fundamental analysis suggests that the rotation behaves as noise during training, as opposed to the translation (baseline) which provides supervision signals. To address the challenge, we propose a data pre-processing method that rectifies training images by removing their relative rotations for effective learning. The significantly improved performance validates our motivation. Towards end-to-end learning without requiring pre-processing, we propose an Auto-Rectify Network with novel loss functions, which can automatically learn to rectify images during training. Consequently, our results outperform the previous unsupervised SOTA method by a large margin on the challenging NYUv2 dataset. We also demonstrate the generalization of our trained model in ScanNet and Make3D, and the universality of our proposed learning method on 7-Scenes and KITTI datasets.

Jia-Wang Bian, Huangying Zhan, Naiyan Wang, Tat-Jun Chin, Chunhua Shen, Ian Reid• 2020

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationKITTI (Eigen)
Abs Rel0.118
502
Depth EstimationNYU v2 (test)
Threshold Accuracy (delta < 1.25)80.4
423
Monocular Depth EstimationNYU v2 (test)
Abs Rel0.138
257
Monocular Depth EstimationMake3D (test)
Abs Rel0.305
132
Monocular Depth EstimationNYU V2--
113
Single-view depth estimationNYUv2 36 (test)
AbsRel0.138
21
Single-view depth estimationNYU official 654 images v2 (test)
AbsRel0.138
21
Single-view depth estimationKITTI 33
AbsRel0.118
16
Video Depth EstimationNYUDV2 (Eigen split)
OPW Score0.474
15
Video Depth EstimationKITTI (Eigen split)
Delta1 Acc86.6
9
Showing 10 of 30 rows

Other info

Follow for update