High Quality Monocular Depth Estimation via Transfer Learning

About

Accurate depth estimation from images is a fundamental task in many applications including scene understanding and reconstruction. Existing solutions for depth estimation often produce blurry approximations of low resolution. This paper presents a convolutional neural network for computing a high-resolution depth map given a single RGB image with the help of transfer learning. Following a standard encoder-decoder architecture, we leverage features extracted using high performing pre-trained networks when initializing our encoder along with augmentation and training strategies that lead to more accurate results. We show how, even for a very simple decoder, our method is able to achieve detailed high-resolution depth maps. Our network, with fewer parameters and training iterations, outperforms state-of-the-art on two datasets and also produces qualitatively better results that capture object boundaries more faithfully. Code and corresponding pre-trained weights are made publicly available.

Ibraheem Alhashim, Peter Wonka• 2018

Related benchmarks

Task	Dataset	Result
Depth Estimation	NYU v2 (test)	Threshold Accuracy (delta < 1.25)84.6	435
Depth Estimation	NYU Depth V2	RMSE0.39	209
Monocular Depth Estimation	KITTI (test)	--	114
Depth Prediction	NYU Depth V2 (test)	Accuracy (δ < 1.25)89.5	113
Monocular Depth Estimation	NYU Depth Eigen v2 (test)	A.Rel0.093	49
Depth Estimation	DIODE Indoor	Relative Error (REL)0.6599	24
Depth Completion	ClearGrasp (test)	MAE (m)0.26	14
Monocular Depth Estimation	NYU Depth V2 (test)	--	5
Monocular Depth Estimation	Unreal-1k	Delta 154.4	3

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord