Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

About

While methods for monocular depth estimation have made significant strides on standard benchmarks, zero-shot metric depth estimation remains unsolved. Challenges include the joint modeling of indoor and outdoor scenes, which often exhibit significantly different distributions of RGB and depth, and the depth-scale ambiguity due to unknown camera intrinsics. Recent work has proposed specialized multi-head architectures for jointly modeling indoor and outdoor scenes. In contrast, we advocate a generic, task-agnostic diffusion model, with several advancements such as log-scale depth parameterization to enable joint modeling of indoor and outdoor scenes, conditioning on the field-of-view (FOV) to handle scale ambiguity and synthetically augmenting FOV during training to generalize beyond the limited camera intrinsics in training datasets. Furthermore, by employing a more diverse training mixture than is common, and an efficient diffusion parameterization, our method, DMD (Diffusion for Metric Depth) achieves a 25\% reduction in relative error (REL) on zero-shot indoor and 33\% reduction on zero-shot outdoor datasets over the current SOTA using only a small number of denoising steps. For an overview see https://diffusion-vision.github.io/dmd

Saurabh Saxena, Junhwa Hur, Charles Herrmann, Deqing Sun, David J. Fleet• 2023

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationNYU v2 (test)
Abs Rel0.072
257
Monocular Depth EstimationDDAD (test)
RMSE5.365
122
Depth EstimationSUN RGB-D (test)
Root Mean Square Error (RMS)0.275
93
Monocular Depth EstimationKITTI Eigen (test)
AbsRel0.053
46
Depth EstimationiBims 1 (test)
REL0.118
41
Monocular Depth EstimationDiode Indoor (test)
A.Rel0.291
25
Monocular Depth EstimationKITTI official (val)
RMSE2.411
23
Monocular Depth EstimationSUN-RGBD (test)
AbsRel0.109
22
Monocular Depth EstimationVirtual KITTI 2 (test)
Delta 1 Acc89
22
Monocular Depth EstimationDIODE Outdoor (test)
RMSE8.943
18
Showing 10 of 16 rows

Other info

Code

Follow for update