SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation
About
We propose SharpDepth, a novel approach to monocular metric depth estimation that combines the metric accuracy of discriminative depth estimation methods (e.g., Metric3D, UniDepth) with the fine-grained boundary sharpness typically achieved by generative methods (e.g., Marigold, Lotus). Traditional discriminative models trained on real-world data with sparse ground-truth depth can accurately predict metric depth but often produce over-smoothed or low-detail depth maps. Generative models, in contrast, are trained on synthetic data with dense ground truth, generating depth maps with sharp boundaries yet only providing relative depth with low accuracy. Our approach bridges these limitations by integrating metric accuracy with detailed boundary preservation, resulting in depth predictions that are both metrically precise and visually sharp. Our extensive zero-shot evaluations on standard depth estimation benchmarks confirm SharpDepth effectiveness, showing its ability to achieve both high depth accuracy and detailed representation, making it well-suited for applications requiring high-quality depth perception across diverse, real-world environments.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Monocular Depth Estimation | KITTI | Abs Rel0.06 | 161 | |
| Monocular Depth Estimation | ETH3D | AbsRel47 | 117 | |
| Monocular Depth Estimation | NYU V2 | Delta 1 Acc97 | 113 | |
| Monocular Depth Estimation | DIODE | AbsRel29 | 93 | |
| Depth Prediction | Sintel | AbsRel0.92 | 32 | |
| Monocular Depth Estimation | Booster | δ128 | 26 | |
| Visual SLAM | TUM RGB-D fr1 desk | -- | 21 | |
| Monocular Depth Estimation | nuScenes | A.Rel0.18 | 18 | |
| Depth Estimation | iBims | Abs Rel Error39 | 14 | |
| Depth Estimation | UnrealStereo4K | Eps DBE Acc1.37 | 8 |