Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MFNet: Multi-class Few-shot Segmentation Network with Pixel-wise Metric Learning

About

In visual recognition tasks, few-shot learning requires the ability to learn object categories with few support examples. Its re-popularity in light of the deep learning development is mainly in image classification. This work focuses on few-shot semantic segmentation, which is still a largely unexplored field. A few recent advances are often restricted to single-class few-shot segmentation. In this paper, we first present a novel multi-way (class) encoding and decoding architecture which effectively fuses multi-scale query information and multi-class support information into one query-support embedding. Multi-class segmentation is directly decoded upon this embedding. For better feature fusion, a multi-level attention mechanism is proposed within the architecture, which includes the attention for support feature modulation and attention for multi-scale combination. Last, to enhance the embedding space learning, an additional pixel-wise metric learning module is introduced with triplet loss formulated on the pixel-level embedding of the input image. Extensive experiments on standard benchmarks PASCAL-5i and COCO-20i show clear benefits of our method over the state of the art in few-shot segmentation

Miao Zhang, Miaojing Shi, Li Li• 2021

Related benchmarks

TaskDatasetResultRank
Few-shot Semantic SegmentationCOCO-20i
mIoU41.9
115
Few-shot Semantic SegmentationPASCAL-5i
mIoU59.7
96
Few-shot Semantic SegmentationCOCO-20i (test)
mIoU (mean)24.1
79
Few-shot SegmentationPascal-5^i 1-way 1-shot
mIoU64.5
71
Few-shot Semantic SegmentationCOCO 20^i (test)--
14
2-way Few-Shot Semantic SegmentationPASCAL-5i (fold-0)
mIoU (Variant)58.5
10
2-way Few-Shot Semantic SegmentationPASCAL-5i (fold-1)
mIoU (Variant)70
10
2-way Few-Shot Semantic SegmentationPASCAL-5i (fold-3)
mIoU (asterisk)54.6
10
2-way Few-Shot Semantic SegmentationPASCAL-5i (Mean)
mIoU (Variant)59.7
10
2-way Few-Shot Semantic SegmentationPASCAL-5i (fold-2)
mIoU57
10
Showing 10 of 12 rows

Other info

Code

Follow for update