Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors

About

We propose HYBRIDDEPTH, a robust depth estimation pipeline that addresses key challenges in depth estimation,including scale ambiguity, hardware heterogeneity, and generalizability. HYBRIDDEPTH leverages focal stack, data conveniently accessible in common mobile devices, to produce accurate metric depth maps. By incorporating depth priors afforded by recent advances in singleimage depth estimation, our model achieves a higher level of structural detail compared to existing methods. We test our pipeline as an end-to-end system, with a newly developed mobile client to capture focal stacks, which are then sent to a GPU-powered server for depth estimation. Comprehensive quantitative and qualitative analyses demonstrate that HYBRIDDEPTH outperforms state-of-the-art(SOTA) models on common datasets such as DDFF12 and NYU Depth V2. HYBRIDDEPTH also shows strong zero-shot generalization. When trained on NYU Depth V2, HYBRIDDEPTH surpasses SOTA models in zero-shot performance on ARKitScenes and delivers more structurally accurate depth maps on Mobile Depth. The code is available at https://github.com/cake-lab/HybridDepth/.

Ashkan Ganj, Hang Su, Tian Guo• 2024

Related benchmarks

TaskDatasetResultRank
Depth EstimationNYU Depth V2
RMSE0.128
177
Depth EstimationARKitScenes (val)
RMSE0.29
7
Depth EstimationDDFF12
MSE5.10e-4
6
Depth EstimationNYU Depth distances under 2 meters V2
RMSE0.082
2
Showing 4 of 4 rows

Other info

Code

Follow for update