Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

$\mathrm{D}^\mathrm{3}$-Predictor: Noise-Free Deterministic Diffusion for Dense Prediction

About

Although diffusion models with strong visual priors have emerged as powerful dense prediction backbones, they overlook a core limitation: the stochastic noise at the core of diffusion sampling is inherently misaligned with dense prediction that requires a deterministic mapping from image to geometry. In this paper, we show that this stochastic noise corrupts fine-grained spatial cues and pushes the model toward timestep-specific noise objectives, consequently destroying meaningful geometric structure mappings. To address this, we introduce $\mathrm{D}^\mathrm{3}$-Predictor, a noise-free deterministic diffusion-based dense prediction model built by reformulating a pretrained diffusion model without stochasticity noise. Instead of relying on noisy inputs to leverage diffusion priors, $\mathrm{D}^\mathrm{3}$-Predictor views the pretrained diffusion network as an ensemble of timestep-dependent visual experts and self-supervisedly aggregates their heterogeneous priors into a single, clean, and complete geometric prior. Meanwhile, we utilize task-specific supervision to seamlessly adapt this noise-free prior to dense prediction tasks. Extensive experiments on various dense prediction tasks demonstrate that $\mathrm{D}^\mathrm{3}$-Predictor achieves competitive or state-of-the-art performance in diverse scenarios. In addition, it requires less than half the training data previously used and efficiently performs inference in a single step. Our code, data, and checkpoints are publicly available at https://x-gengroup.github.io/HomePage_D3-Predictor/.

Changliang Xia, Chengyou Jia, Minnan Luo, Zhuohang Dang, Xin Shen, Bowen Ping• 2025

Related benchmarks

TaskDatasetResultRank
Depth EstimationScanNet
AbsRel0.063
94
Depth EstimationKITTI
AbsRel0.082
92
Depth EstimationDIODE
Delta-1 Accuracy77.9
62
Depth EstimationNYU
AbsRel0.052
20
Depth EstimationETH3D
AbsRel0.062
19
Auto. Tree TaggingAuto. Tree Tagging (test)
mIoU0.857
4
Cigarette DetectionCigarette Detect
mIoU87.5
4
Close-Range SegmentationClose-Range Segment
IoU65.5
4
Food Cell InspectionFood Cell Inspect (test)
IoU97.6
4
Mask-Wear MonitoringMask-Wear Monitor (test)
IoU90.3
4
Showing 10 of 25 rows

Other info

Follow for update