Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AdaBins: Depth Estimation using Adaptive Bins

About

We address the problem of estimating a high quality dense depth map from a single RGB input image. We start out with a baseline encoder-decoder convolutional neural network architecture and pose the question of how the global processing of information can help improve overall depth estimation. To this end, we propose a transformer-based architecture block that divides the depth range into bins whose center value is estimated adaptively per image. The final depth values are estimated as linear combinations of the bin centers. We call our new building block AdaBins. Our results show a decisive improvement over the state-of-the-art on several popular depth datasets across all metrics. We also validate the effectiveness of the proposed block with an ablation study and provide the code and corresponding pre-trained weights of the new state-of-the-art model.

Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka• 2020

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationKITTI (Eigen)
Abs Rel0.058
523
Depth EstimationNYU v2 (test)
Threshold Accuracy (delta < 1.25)90.3
432
Monocular Depth EstimationNYU v2 (test)
Abs Rel0.103
300
Depth EstimationKITTI (Eigen split)
RMSE2.36
291
Monocular Depth EstimationKITTI (Eigen split)
Abs Rel0.058
215
Depth EstimationNYU Depth V2
RMSE0.364
209
Monocular Depth EstimationKITTI
Abs Rel0.058
203
Monocular Depth EstimationKITTI Raw Eigen (test)
RMSE2.36
159
Monocular Depth EstimationNYU V2
Delta 1 Acc90.3
131
Monocular Depth EstimationKITTI 80m maximum depth (Eigen)
Abs Rel0.058
126
Showing 10 of 88 rows
...

Other info

Code

Follow for update