Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Enforcing geometric constraints of virtual normal for depth prediction

About

Monocular depth prediction plays a crucial role in understanding 3D scene geometry. Although recent methods have achieved impressive progress in evaluation metrics such as the pixel-wise relative error, most methods neglect the geometric constraints in the 3D space. In this work, we show the importance of the high-order 3D geometric constraints for depth prediction. By designing a loss term that enforces one simple type of geometric constraints, namely, virtual normal directions determined by randomly sampled three points in the reconstructed 3D space, we can considerably improve the depth prediction accuracy. Significantly, the byproduct of this predicted depth being sufficiently accurate is that we are now able to recover good 3D structures of the scene such as the point cloud and surface normal directly from the depth, eliminating the necessity of training new sub-models as was previously done. Experiments on two benchmarks: NYU Depth-V2 and KITTI demonstrate the effectiveness of our method and state-of-the-art performance.

Wei Yin, Yifan Liu, Chunhua Shen, Youliang Yan• 2019

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationKITTI (Eigen)
Abs Rel0.072
502
Depth EstimationNYU v2 (test)
Threshold Accuracy (delta < 1.25)87.5
423
Depth EstimationKITTI (Eigen split)
RMSE3.172
276
Monocular Depth EstimationNYU v2 (test)
Abs Rel0.108
257
Monocular Depth EstimationKITTI (Eigen split)
Abs Rel0.072
193
Depth EstimationNYU Depth V2
RMSE0.416
177
Monocular Depth EstimationKITTI
Abs Rel0.072
161
Monocular Depth EstimationKITTI Raw Eigen (test)
RMSE3.258
159
Monocular Depth EstimationKITTI 80m maximum depth (Eigen)
Abs Rel0.072
126
Surface Normal PredictionNYU V2
Mean Error24.6
100
Showing 10 of 55 rows

Other info

Code

Follow for update