Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation

About

In this paper, we propose \textbf{Iris}, a deterministic framework for Monocular Depth Estimation (MDE) that integrates real-world priors into the diffusion model. Conventional feed-forward methods rely on massive training data, yet still miss details. Previous diffusion-based methods leverage rich generative priors yet struggle with synthetic-to-real domain transfer. Iris, in contrast, preserves fine details, generalizes strongly from synthetic to real scenes, and remains efficient with limited training data. To this end, we introduce a two-stage Priors-to-Geometry Deterministic (PGD) schedule: the prior stage uses Spectral-Gated Distillation (SGD) to transfer low-frequency real priors while leaving high-frequency details unconstrained, and the geometry stage applies Spectral-Gated Consistency (SGC) to enforce high-frequency fidelity while refining with synthetic ground truth. The two stages share weights and are executed with a high-to-low timestep schedule. Extensive experimental results confirm that Iris achieves significant improvements in MDE performance with strong in-the-wild generalization.

Xinhao Cai, Gensheng Pei, Zeren Sun, Yazhou Yao, Fumin Shen, Wenguan Wang• 2026

Related benchmarks

TaskDatasetResultRank
Affine-invariant depth estimationETH3D
AbsRel5.5
59
Affine-invariant depth estimationNYU V2
AbsRel4.9
59
Affine-invariant depth estimationScanNet
AbsRel5
58
Affine-invariant depth estimationKITTI Outdoor
AbsRel7.2
46
Affine-invariant depth estimationDIODE Full
AbsRel24.3
29
Affine-invariant depth estimationDIODE Various
AbsRel24.3
27
Affine-invariant depth estimationKITTI, NYUv2, ETH3D, ScanNet, DIODE, DA-2K Aggregated
Group Avg Rank1.5
16
Affine-invariant depth estimationConsolidated (KITTI, NYUv2, ETH3D, ScanNet, DIODE)
Average Ranking (All)3.1
16
Affine-invariant depth estimationDA-2K
Accuracy94.5
16
Showing 9 of 9 rows

Other info

Follow for update