Need for Speed: Zero-Shot Depth Completion with Single-Step Diffusion
About
We introduce Marigold-SSD, a single-step, late-fusion depth completion framework that leverages strong diffusion priors while eliminating the costly test-time optimization typically associated with diffusion-based methods. By shifting computational burden from inference to finetuning, our approach enables efficient and robust 3D perception under real-world latency constraints. Marigold-SSD achieves significantly faster inference with a training cost of only 4.5 GPU days. We evaluate our method across four indoor and two outdoor benchmarks, demonstrating strong cross-domain generalization and zero-shot performance compared to existing depth completion approaches. Our approach significantly narrows the efficiency gap between diffusion-based and discriminative models. Finally, we challenge common evaluation protocols by analyzing performance under varying input sparsity levels. Page: https://dtu-pas.github.io/marigold-ssd/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Depth Completion | KITTI | RMSE1.496 | 37 | |
| Depth Completion | NYU V2 | RMSE0.128 | 32 | |
| Depth Completion | iBIMS-1 | MAE0.056 | 27 | |
| Depth Completion | Overall Average (ScanNet, IBims-1, VOID, NYUv2, KITTI, DDAD) | Rank3.75 | 17 | |
| Depth Completion | VOID | MAE0.177 | 17 | |
| Depth Completion | DDAD | MAE2.066 | 16 | |
| Depth Completion | ScanNet | -- | 16 | |
| Depth Completion | KITTI (val) | -- | 6 |