Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Single-Image Depth from Videos using Quality Assessment Networks

About

Depth estimation from a single image in the wild remains a challenging problem. One main obstacle is the lack of high-quality training data for images in the wild. In this paper we propose a method to automatically generate such data through Structure-from-Motion (SfM) on Internet videos. The core of this method is a Quality Assessment Network that identifies high-quality reconstructions obtained from SfM. Using this method, we collect single-view depth training data from a large number of YouTube videos and construct a new dataset called YouTube3D. Experiments show that YouTube3D is useful in training depth estimation networks and advances the state of the art of single-view depth estimation in the wild.

Weifeng Chen, Shengyi Qian, Jia Deng• 2018

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationKITTI
Abs Rel0.327
161
Depth EstimationKITTI
AbsRel37.9
92
Monocular Depth EstimationScanNet
AbsRel16.5
64
Depth EstimationDIODE
Delta-1 Accuracy66
62
Depth PredictionETH3D
AbsRel23.7
35
Depth PredictionSintel--
32
2D Depth EstimationScanNet
AbsRel23.7
26
Monocular Depth EstimationNYU
AbsRel16.6
21
Depth PredictionNYU
Delta-1 Accuracy77.3
16
Depth PredictionYT3D
AbsRel20.9
9
Showing 10 of 11 rows

Other info

Follow for update