Self-supervised Sparse-to-Dense: Self-supervised Depth Completion from LiDAR and Monocular Camera
About
Depth completion, the technique of estimating a dense depth image from sparse depth measurements, has a variety of applications in robotics and autonomous driving. However, depth completion faces 3 main challenges: the irregularly spaced pattern in the sparse depth input, the difficulty in handling multiple sensor modalities (when color images are available), as well as the lack of dense, pixel-level ground truth depth labels. In this work, we address all these challenges. Specifically, we develop a deep regression model to learn a direct mapping from sparse depth (and color images) to dense depth. We also propose a self-supervised training framework that requires only sequences of color and sparse depth images, without the need for dense depth labels. Our experiments demonstrate that our network, when trained with semi-dense annotations, attains state-of-the- art accuracy and is the winning approach on the KITTI depth completion benchmark at the time of submission. Furthermore, the self-supervised framework outperforms a number of existing solutions trained with semi- dense annotations.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Depth Completion | NYU-depth-v2 official (test) | RMSE0.352 | 187 | |
| Depth Completion | KITTI depth completion official (test) | RMSE (mm)814.7 | 154 | |
| Depth Prediction | NYU Depth V2 (test) | Accuracy (δ < 1.25)97.8 | 113 | |
| Depth Completion | KITTI (test) | RMSE1.30e+3 | 67 | |
| Depth Completion | KITTI online leaderboard (test) | MAE0.25 | 48 | |
| Depth Completion | KITTI depth completion (val) | RMSE (mm)814.7 | 34 | |
| Depth Completion | KITTI-Depth | MAE249.9 | 27 | |
| Depth Completion | VOID (test) | MAE178.8 | 18 | |
| Depth Completion | KITTI supervised official | MAE249.9 | 12 | |
| Depth Completion | KITTI Depth Completion supervised track (online benchmark) | MAE (m)0.25 | 10 |