IEBins: Iterative Elastic Bins for Monocular Depth Estimation
About
Monocular depth estimation (MDE) is a fundamental topic of geometric computer vision and a core technique for many downstream applications. Recently, several methods reframe the MDE as a classification-regression problem where a linear combination of probabilistic distribution and bin centers is used to predict depth. In this paper, we propose a novel concept of iterative elastic bins (IEBins) for the classification-regression-based MDE. The proposed IEBins aims to search for high-quality depth by progressively optimizing the search range, which involves multiple stages and each stage performs a finer-grained depth search in the target bin on top of its previous stage. To alleviate the possible error accumulation during the iterative process, we utilize a novel elastic target bin to replace the original target bin, the width of which is adjusted elastically based on the depth uncertainty. Furthermore, we develop a dedicated framework composed of a feature extractor and an iterative optimizer that has powerful temporal context modeling capabilities benefiting from the GRU-based architecture. Extensive experiments on the KITTI, NYU-Depth-v2 and SUN RGB-D datasets demonstrate that the proposed method surpasses prior state-of-the-art competitors. The source code is publicly available at https://github.com/ShuweiShao/IEBins.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Depth Estimation | NYU v2 (test) | Threshold Accuracy (delta < 1.25)93.6 | 423 | |
| Monocular Depth Estimation | NYU v2 (test) | Abs Rel0.087 | 257 | |
| Monocular Depth Estimation | KITTI (Eigen split) | Abs Rel0.05 | 193 | |
| Depth Estimation | NYU Depth V2 | RMSE0.314 | 177 | |
| Monocular Depth Estimation | KITTI (test) | Abs Rel Error0.05 | 103 | |
| Depth Estimation | KITTI | AbsRel0.05 | 92 | |
| Monocular Depth Estimation | NYU-Depth v2 (official) | Abs Rel0.087 | 75 | |
| Monocular Depth Estimation | KITTI Eigen (test) | AbsRel0.05 | 46 | |
| Metric Depth Estimation | KITTI in-domain (test) | Acc (δ < 1.25)97.8 | 27 | |
| Monocular Depth Estimation | KITTI official (val) | RMSE2.37 | 23 |