Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UniDepth: Universal Monocular Metric Depth Estimation

About

Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to generalize to unseen domains even in the presence of moderate domain gaps, which hinders their practical applicability. We propose a new model, UniDepth, capable of reconstructing metric 3D scenes from solely single images across domains. Departing from the existing MMDE methods, UniDepth directly predicts metric 3D points from the input image at inference time without any additional information, striving for a universal and flexible MMDE solution. In particular, UniDepth implements a self-promptable camera module predicting dense camera representation to condition depth features. Our model exploits a pseudo-spherical output representation, which disentangles camera and depth representations. In addition, we propose a geometric invariance loss that promotes the invariance of camera-prompted depth features. Thorough evaluations on ten datasets in a zero-shot regime consistently demonstrate the superior performance of UniDepth, even when compared with methods directly trained on the testing domains. Code and models are available at: https://github.com/lpiccinelli-eth/unidepth

Luigi Piccinelli, Yung-Hsu Yang, Christos Sakaridis, Mattia Segu, Siyuan Li, Luc Van Gool, Fisher Yu• 2024

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationNYU v2 (test)
Abs Rel0.063
257
Novel View SynthesisTanks&Temples (test)--
239
Monocular Depth EstimationKITTI
Abs Rel0.05
161
Monocular Depth EstimationDDAD (test)
RMSE5.399
122
Monocular Depth EstimationETH3D
AbsRel0.457
117
Monocular Depth EstimationNYU V2
Delta 1 Acc98
113
Video Depth EstimationSintel
Relative Error (Rel)0.473
109
Monocular Depth EstimationKITTI (test)
Abs Rel Error0.047
103
Video Depth EstimationBONN
Relative Error (Rel)0.057
103
Monocular Depth EstimationKITTI Eigen split (test)
AbsRel Mean4.21
94
Showing 10 of 77 rows
...

Other info

Code

Follow for update