Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-view Aggregation Network for Dichotomous Image Segmentation

About

Dichotomous Image Segmentation (DIS) has recently emerged towards high-precision object segmentation from high-resolution natural images. When designing an effective DIS model, the main challenge is how to balance the semantic dispersion of high-resolution targets in the small receptive field and the loss of high-precision details in the large receptive field. Existing methods rely on tedious multiple encoder-decoder streams and stages to gradually complete the global localization and local refinement. Human visual system captures regions of interest by observing them from multiple views. Inspired by it, we model DIS as a multi-view object perception problem and provide a parsimonious multi-view aggregation network (MVANet), which unifies the feature fusion of the distant view and close-up view into a single stream with one encoder-decoder structure. With the help of the proposed multi-view complementary localization and refinement modules, our approach established long-range, profound visual interactions across multiple views, allowing the features of the detailed close-up view to focus on highly slender structures.Experiments on the popular DIS-5K dataset show that our MVANet significantly outperforms state-of-the-art methods in both accuracy and speed. The source code and datasets will be publicly available at \href{https://github.com/qianyu-dlut/MVANet}{MVANet}.

Qian Yu, Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu• 2024

Related benchmarks

TaskDatasetResultRank
Dichotomous Image SegmentationDIS5K (DIS-VD)
S_alpha0.909
30
Dichotomous Image SegmentationDIS5K TE (1-4) (test)
Fw_beta85.5
25
Dichotomous Image SegmentationDIS TE4 500 (test)
Fmax91.2
14
Dichotomous Image SegmentationDIS 470 (val)
Fmax0.904
14
Dichotomous Image SegmentationDIS TE1 500 (test)
Fmax88.1
14
Dichotomous Image SegmentationDIS-TE2 500 (test)
Fmax91.6
14
Dichotomous Image SegmentationDIS-TE3 500 (test)
Fmax92.9
14
Dichotomous Image SegmentationDIS ALL 2,000 (test)
Fmax90.8
14
Dichotomous Image SegmentationDIS5K DIS-TE1 (test)
Fmax89.3
12
Dichotomous Image SegmentationDIS5K DIS-TE2 (test)
Fmax92.5
12
Showing 10 of 13 rows

Other info

Follow for update