Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation

About

Semi-supervised semantic segmentation (SSSS) has been proposed to alleviate the burden of time-consuming pixel-level manual labeling, which leverages limited labeled data along with larger amounts of unlabeled data. Current state-of-the-art methods train the labeled data with ground truths and unlabeled data with pseudo labels. However, the two training flows are separate, which allows labeled data to dominate the training process, resulting in low-quality pseudo labels and, consequently, sub-optimal results. To alleviate this issue, we present AllSpark, which reborns the labeled features from unlabeled ones with the channel-wise cross-attention mechanism. We further introduce a Semantic Memory along with a Channel Semantic Grouping strategy to ensure that unlabeled features adequately represent labeled features. The AllSpark shed new light on the architecture level designs of SSSS rather than framework level, which avoids increasingly complicated training pipeline designs. It can also be regarded as a flexible bottleneck module that can be seamlessly integrated into a general transformer-based segmentation model. The proposed AllSpark outperforms existing methods across all evaluation protocols on Pascal, Cityscapes and COCO benchmarks without bells-and-whistles. Code and model weights are available at: https://github.com/xmed-lab/AllSpark.

Haonan Wang, Qixiang Zhang, Yi Li, Xiaomeng Li• 2024

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPascal VOC (Original set)
mIoU82.1
105
Semantic segmentationPotsdam (test)
mIoU74.76
104
Semantic segmentationCOCO
mIoU50.9
96
Semantic segmentationPascal VOC blended 2012 (train)--
96
Semantic segmentationPASCAL VOC Augmented 2012
mIoU82.04
85
Semantic segmentationCityscapes 1/4 (744 labels)
mIoU80.6
80
Semantic segmentationCityscapes 1/16 (186 labeled samples)
mIoU78.33
68
Semantic segmentationCITYSCAPES 1/8 labeled samples 372 labels (val)
mIoU79.2
65
Semantic segmentationLoveDA
IoU (Background)67.13
60
Semantic segmentationPascal VOC 1/16 labeled 2012 (train)
mIoU76.1
53
Showing 10 of 40 rows

Other info

Code

Follow for update