Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

About

Self-training has shown great potential in semi-supervised learning. Its core idea is to use the model learned on labeled data to generate pseudo-labels for unlabeled samples, and in turn teach itself. To obtain valid supervision, active attempts typically employ a momentum teacher for pseudo-label prediction yet observe the confirmation bias issue, where the incorrect predictions may provide wrong supervision signals and get accumulated in the training process. The primary cause of such a drawback is that the prevailing self-training framework acts as guiding the current state with previous knowledge, because the teacher is updated with the past student only. To alleviate this problem, we propose a novel self-training strategy, which allows the model to learn from the future. Concretely, at each training step, we first virtually optimize the student (i.e., caching the gradients without applying them to the model weights), then update the teacher with the virtual future student, and finally ask the teacher to produce pseudo-labels for the current student as the guidance. In this way, we manage to improve the quality of pseudo-labels and thus boost the performance. We also develop two variants of our future-self-training (FST) framework through peeping at the future both deeply (FST-D) and widely (FST-W). Taking the tasks of unsupervised domain adaptive semantic segmentation and semi-supervised semantic segmentation as the instances, we experimentally demonstrate the effectiveness and superiority of our approach under a wide range of settings. Code will be made publicly available.

Ye Du, Yujun Shen, Haochen Wang, Jingjing Fei, Wei Li, Liwei Wu, Rui Zhao, Zehua Fu, Qingjie Liu• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU78.1
2040
Semantic segmentationGTA5 → Cityscapes (val)
mIoU69.3
533
Semantic segmentationSYNTHIA to Cityscapes
Road IoU88.3
150
Semantic segmentationPASCAL VOC 2012 (val)
mIoU78.1
126
Semantic segmentationGTA5 to Cityscapes
mIoU69.3
58
Semantic segmentationSYNTHIA-to-Cityscapes (SYN2CS) 16 classes (val)--
50
Semantic segmentationPascal Blended 1/4 augmented (train)
mIoU78.1
32
Semantic segmentationPascal Blended augmented (1/8 train)
mIoU76.1
32
Semantic segmentationPascal Blended 662 labels augmented (1/16 train)
mIoU73.9
31
Unsupervised Domain Adaptation (Semantic Segmentation)SYNTHIA to Cityscapes (val)
Road88.3
6
Showing 10 of 10 rows

Other info

Code

Follow for update