Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unsupervised Monocular Depth Learning in Dynamic Scenes

About

We present a method for jointly training the estimation of depth, ego-motion, and a dense 3D translation field of objects relative to the scene, with monocular photometric consistency being the sole source of supervision. We show that this apparently heavily underdetermined problem can be regularized by imposing the following prior knowledge about 3D translation fields: they are sparse, since most of the scene is static, and they tend to be constant for rigid moving objects. We show that this regularization alone is sufficient to train monocular depth prediction models that exceed the accuracy achieved in prior work for dynamic scenes, including methods that require semantic input. Code is at https://github.com/google-research/google-research/tree/master/depth_and_motion_learning .

Hanhan Li, Ariel Gordon, Hang Zhao, Vincent Casser, Anelia Angelova• 2020

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationKITTI (Eigen)
Abs Rel0.13
502
Depth EstimationKITTI (Eigen split)
RMSE5.138
276
Monocular Depth EstimationKITTI
Abs Rel0.13
161
Monocular Depth EstimationKITTI Raw Eigen (test)
RMSE5.138
159
Monocular Depth EstimationKITTI Improved GT (Eigen)
AbsRel0.13
92
Monocular Depth EstimationCityscapes
Accuracy (delta < 1.25)84.6
62
Depth PredictionCityscapes (test)
RMSE6.98
52
Depth EstimationCityscapes (test)--
40
Depth PredictionKITTI original ground truth (test)
Abs Rel0.13
38
Depth PredictionKITTI original (Eigen split)
Abs Rel0.13
29
Showing 10 of 13 rows

Other info

Follow for update