Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

The Midas Touch for Metric Depth

About

Recent advances have markedly improved the cross-scene generalization of relative depth estimation, yet its practical applicability remains limited by the absence of metric scale, local inconsistencies, and low computational efficiency. To address these issues, we present \emph{\textbf{M}idas \textbf{T}ouch for \textbf{D}epth} (MTD), a mathematically interpretable approach that converts relative depth into metric depth using only extremely sparse 3D data. To eliminate local scale inconsistencies, it applies a segment-wise recovery strategy via sparse graph optimization, followed by a pixel-wise refinement strategy using a discontinuity-aware geodesic cost. MTD exhibits strong generalization and achieves substantial accuracy improvements over previous depth completion and depth estimation methods. Moreover, its lightweight, plug-and-play design facilitates deployment and integration on diverse downstream 3D tasks. Project page is available at https://mias.group/MTD.

Yu Ma, Zizhan Guo, Zuyi Xiong, Haoran Zhang, Yi Feng, Hongbo Zhao, Hanli Wang, Rui Fan• 2026

Related benchmarks

TaskDatasetResultRank
Depth EstimationNYU Depth V2--
209
Monocular Depth EstimationNYU V2
Delta 1 Acc99.5
174
Monocular Depth EstimationETH3D
AbsRel1.7
159
Depth EstimationKITTI--
156
Monocular Depth EstimationDIODE
AbsRel9.3
147
Depth EstimationScanNet
AbsRel0.042
121
Monocular Depth EstimationScanNet
AbsRel1.3
103
Depth EstimationDIODE
Delta-1 Accuracy81.6
82
Monocular Depth EstimationKITTI
AbsRel2.2
69
Stereo MatchingETH3D--
57
Showing 10 of 32 rows

Other info

Follow for update