Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler

About

Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to generalize to unseen domains even in the presence of moderate domain gaps, which hinders their practical applicability. We propose a new model, UniDepthV2, capable of reconstructing metric 3D scenes from solely single images across domains. Departing from the existing MMDE paradigm, UniDepthV2 directly predicts metric 3D points from the input image at inference time without any additional information, striving for a universal and flexible MMDE solution. In particular, UniDepthV2 implements a self-promptable camera module predicting a dense camera representation to condition depth features. Our model exploits a pseudo-spherical output representation, which disentangles the camera and depth representations. In addition, we propose a geometric invariance loss that promotes the invariance of camera-prompted depth features. UniDepthV2 improves its predecessor UniDepth model via a new edge-guided loss which enhances the localization and sharpness of edges in the metric depth outputs, a revisited, simplified and more efficient architectural design, and an additional uncertainty-level output which enables downstream tasks requiring confidence. Thorough evaluations on ten depth datasets in a zero-shot regime consistently demonstrate the superior performance and generalization of UniDepthV2. Code and models are available at https://github.com/lpiccinelli-eth/UniDepth

Luigi Piccinelli, Christos Sakaridis, Yung-Hsu Yang, Mattia Segu, Siyuan Li, Wim Abbeloos, Luc Van Gool• 2025

Related benchmarks

TaskDatasetResultRank
Depth EstimationNYU Depth V2
RMSE0.18
177
Monocular Depth EstimationKITTI
Abs Rel5.98
161
Monocular Depth EstimationETH3D
AbsRel15
117
Depth EstimationScanNet
AbsRel2.7
94
Monocular Depth EstimationDIODE
AbsRel41
93
Depth EstimationKITTI
AbsRel0.037
92
Depth EstimationScanNet (test)--
65
Monocular Depth EstimationiBIMS-1
ARel0.0771
32
Monocular Depth EstimationSUN-RGBD zero-shot
Delta Accuracy (< 1.25)96.4
29
Depth EstimationHAMMER
Delta 10.645
29
Showing 10 of 73 rows
...

Other info

Code

Follow for update