Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Beyond the Prototype: Divide-and-conquer Proxies for Few-shot Segmentation

About

Few-shot segmentation, which aims to segment unseen-class objects given only a handful of densely labeled samples, has received widespread attention from the community. Existing approaches typically follow the prototype learning paradigm to perform meta-inference, which fails to fully exploit the underlying information from support image-mask pairs, resulting in various segmentation failures, e.g., incomplete objects, ambiguous boundaries, and distractor activation. To this end, we propose a simple yet versatile framework in the spirit of divide-and-conquer. Specifically, a novel self-reasoning scheme is first implemented on the annotated support image, and then the coarse segmentation mask is divided into multiple regions with different properties. Leveraging effective masked average pooling operations, a series of support-induced proxies are thus derived, each playing a specific role in conquering the above challenges. Moreover, we devise a unique parallel decoder structure that integrates proxies with similar attributes to boost the discrimination power. Our proposed approach, named divide-and-conquer proxies (DCP), allows for the development of appropriate and reliable information as a guide at the "episode" level, not just about the object cues themselves. Extensive experiments on PASCAL-5i and COCO-20i demonstrate the superiority of DCP over conventional prototype-based approaches (up to 5~10% on average), which also establishes a new state-of-the-art. Code is available at github.com/chunbolang/DCP.

Chunbo Lang, Binfei Tu, Gong Cheng, Junwei Han• 2022

Related benchmarks

TaskDatasetResultRank
Few-shot SegmentationPASCAL-5i
mIoU (Fold 0)67.19
325
Semantic segmentationCOCO-20i
mIoU (Mean)46.5
132
Semantic segmentationPASCAL-5i
Mean mIoU67.8
111
Few-shot Semantic SegmentationCOCO 5-shot 20i
mIoU46.48
85
Few-shot Semantic SegmentationCOCO 20i 1-shot
mIoU (Overall)41.39
77
Semantic segmentationPASCAL-5^i Fold-3
mIoU64.5
75
Semantic segmentationPASCAL-5^i Fold-2
mIoU66.4
75
Semantic segmentationPASCAL-5^i Fold-1
mIoU73.1
75
Semantic segmentationPASCAL-5^i Fold-0
mIoU67.2
75
Semantic segmentationCOCO-20i (test)
Mean Score46.5
70
Showing 10 of 11 rows

Other info

Code

Follow for update