High-Precision Dichotomous Image Segmentation via Depth Integrity-Prior and Fine-Grained Patch Strategy

About

High-precision dichotomous image segmentation (DIS) is a task of extracting fine-grained objects from high-resolution images. Existing methods trade efficiency for accuracy: non-diffusion methods are fast but suffer from weak semantics and unstable spatial priors, causing false detections; diffusion-based methods offer high accuracy via strong generative priors but are computationally expensive. In depth maps, a complete object appears as a low variance region with a smooth interior and sharp boundaries, whereas the background exhibits a chaotic, high variance pattern due to disconnected surfaces at varying depths. We refer to this as the depth integrity-prior. Inspired by this, and noting that DIS currently lacks depth maps, we leverage pseudo-depth information from monocular depth estimation models to obtain essential semantic understanding, thereby rapidly revealing spatial differences across target objects and the background. To exploit this prior, we propose the Prior-guided Depth Fusion Network (PDFNet), which fuses RGB and pseudo-depth features for depth-aware structure perception. We further introduce a novel depth integrity-prior loss to enforce depth consistency in segmentation and a fine-grained enhancement module with adaptive patch selection to sharpen boundaries. Notably, PDFNet with DAM-v2 achieves SOTA (Fmax 0.915 on DIS-VD and 0.915 on DIS-TE) using less than half the params of diffusion-based methods. Our code is available at https://tennine2077.github.io/PDFNet.github.io/ .

Xianjie Liu, Keren Fu, Qijun Zhao• 2025

Related benchmarks

Task	Dataset	Result
Dichotomous Image Segmentation	DIS5K DIS-TE1 (test)	Fmax88.8	24
Dichotomous Image Segmentation	DIS5K DIS-TE2 (test)	Fmax91.9	24
Dichotomous Image Segmentation	DIS5K DIS-TE3 (test)	Fmax0.929	24
Dichotomous Image Segmentation	DIS5K DIS-TE Overall (test)	Fmax Score0.913	24
Dichotomous Image Segmentation	DIS5K DIS-TE4 (test)	Fmax0.91	24
Salient Object Detection	HRSOD 400 (test)	Fw-beta Score0.943	15
Dichotomous Image Segmentation	DIS 470 (val)	Fmax0.915	14
Dichotomous Image Segmentation	DIS TE1 500 (test)	Fmax89.1	14
Dichotomous Image Segmentation	DIS-TE2 500 (test)	Fmax92	14
Dichotomous Image Segmentation	DIS-TE3 500 (test)	Fmax93.6	14

Showing 10 of 15 rows

Other info

Code

Follow for update

@wizwand_team Discord