Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Semi-Supervised Deep Learning for Monocular Depth Map Prediction

About

Supervised deep learning often suffers from the lack of sufficient training data. Specifically in the context of monocular depth map prediction, it is barely possible to determine dense ground truth depth images in realistic dynamic outdoor environments. When using LiDAR sensors, for instance, noise is present in the distance measurements, the calibration between sensors cannot be perfect, and the measurements are typically much sparser than the camera images. In this paper, we propose a novel approach to depth map prediction from monocular images that learns in a semi-supervised way. While we use sparse ground-truth depth for supervised learning, we also enforce our deep network to produce photoconsistent dense depth maps in a stereo setup using a direct image alignment loss. In experiments we demonstrate superior performance in depth map prediction from single images compared to the state-of-the-art methods.

Yevhen Kuznietsov, J\"org St\"uckler, Bastian Leibe• 2017

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationKITTI (Eigen)
Abs Rel0.113
523
Depth EstimationKITTI (Eigen split)
RMSE3.518
291
Monocular Depth EstimationKITTI
Abs Rel0.113
220
Monocular Depth EstimationKITTI Raw Eigen (test)
RMSE3.518
159
Depth EstimationKITTI
RMSE4.621
156
Monocular Depth EstimationMake3D (test)
Abs Rel0.421
132
Monocular Depth EstimationKITTI 80m maximum depth (Eigen)
Abs Rel0.113
126
Monocular Depth EstimationKITTI (test)--
114
Monocular Depth EstimationKITTI Improved GT (Eigen)
AbsRel0.113
111
Monocular Depth EstimationKITTI Eigen split (test)
AbsRel Mean0.113
100
Showing 10 of 24 rows

Other info

Follow for update