G2-MonoDepth: A General Framework of Generalized Depth Inference from Monocular RGB+X Data
About
Monocular depth inference is a fundamental problem for scene perception of robots. Specific robots may be equipped with a camera plus an optional depth sensor of any type and located in various scenes of different scales, whereas recent advances derived multiple individual sub-tasks. It leads to additional burdens to fine-tune models for specific robots and thereby high-cost customization in large-scale industrialization. This paper investigates a unified task of monocular depth inference, which infers high-quality depth maps from all kinds of input raw data from various robots in unseen scenes. A basic benchmark G2-MonoDepth is developed for this task, which comprises four components: (a) a unified data representation RGB+X to accommodate RGB plus raw depth with diverse scene scale/semantics, depth sparsity ([0%, 100%]) and errors (holes/noises/blurs), (b) a novel unified loss to adapt to diverse depth sparsity/errors of input raw data and diverse scales of output scenes, (c) an improved network to well propagate diverse scene scales from input to output, and (d) a data augmentation pipeline to simulate all types of real artifacts in raw depth maps for training. G2-MonoDepth is applied in three sub-tasks including depth estimation, depth completion with different sparsity, and depth enhancement in unseen scenes, and it always outperforms SOTA baselines on both real-world data and synthetic data.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Depth Completion | iBIMS-1 | MAE0.078 | 43 | |
| Depth Completion | nuScenes | MAE4.587 | 24 | |
| Depth Completion | VOID-1500 | RMSE0.568 | 13 | |
| Depth Completion | VOID-500 | RMSE0.574 | 13 | |
| Depth Completion | VOID-150 | RMSE0.691 | 13 | |
| Depth Completion | Glacier flight | MAE5.099 | 10 | |
| Depth Completion | Farm field | MAE4.656 | 10 | |
| Depth Completion | Industrial hall | MAE4.725 | 10 | |
| Depth Completion | Ballast tank | MAE3.327 | 10 | |
| Depth Completion | Underwater fjord | MAE4.747 | 10 |