Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

High-Precision Dichotomous Image Segmentation via Depth Integrity-Prior and Fine-Grained Patch Strategy

About

High-precision dichotomous image segmentation (DIS) is a task of extracting fine-grained objects from high-resolution images. Existing methods trade efficiency for accuracy: non-diffusion methods are fast but suffer from weak semantics and unstable spatial priors, causing false detections; diffusion-based methods offer high accuracy via strong generative priors but are computationally expensive. In depth maps, a complete object appears as a low variance region with a smooth interior and sharp boundaries, whereas the background exhibits a chaotic, high variance pattern due to disconnected surfaces at varying depths. We refer to this as the depth integrity-prior. Inspired by this, and noting that DIS currently lacks depth maps, we leverage pseudo-depth information from monocular depth estimation models to obtain essential semantic understanding, thereby rapidly revealing spatial differences across target objects and the background. To exploit this prior, we propose the Prior-guided Depth Fusion Network (PDFNet), which fuses RGB and pseudo-depth features for depth-aware structure perception. We further introduce a novel depth integrity-prior loss to enforce depth consistency in segmentation and a fine-grained enhancement module with adaptive patch selection to sharpen boundaries. Notably, PDFNet with DAM-v2 achieves SOTA (Fmax 0.915 on DIS-VD and 0.915 on DIS-TE) using less than half the params of diffusion-based methods. Our code is available at https://tennine2077.github.io/PDFNet.github.io/ .

Xianjie Liu, Keren Fu, Qijun Zhao• 2025

Related benchmarks

TaskDatasetResultRank
Dichotomous Image SegmentationDIS5K DIS-TE1 (test)
Fmax88.8
24
Dichotomous Image SegmentationDIS5K DIS-TE2 (test)
Fmax91.9
24
Dichotomous Image SegmentationDIS5K DIS-TE3 (test)
Fmax0.929
24
Dichotomous Image SegmentationDIS5K DIS-TE Overall (test)
Fmax Score0.913
24
Dichotomous Image SegmentationDIS5K DIS-TE4 (test)
Fmax0.91
24
Salient Object DetectionHRSOD 400 (test)
Fw-beta Score0.943
15
Dichotomous Image SegmentationDIS 470 (val)
Fmax0.915
14
Dichotomous Image SegmentationDIS TE1 500 (test)
Fmax89.1
14
Dichotomous Image SegmentationDIS-TE2 500 (test)
Fmax92
14
Dichotomous Image SegmentationDIS-TE3 500 (test)
Fmax93.6
14
Showing 10 of 15 rows

Other info

Code

Follow for update