Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation

About

Semi-supervised semantic segmentation (SSS) has recently gained increasing research interest as it can reduce the requirement for large-scale fully-annotated training data. The current methods often suffer from the confirmation bias from the pseudo-labelling process, which can be alleviated by the co-training framework. The current co-training-based SSS methods rely on hand-crafted perturbations to prevent the different sub-nets from collapsing into each other, but these artificial perturbations cannot lead to the optimal solution. In this work, we propose a new conflict-based cross-view consistency (CCVC) method based on a two-branch co-training framework which aims at enforcing the two sub-nets to learn informative features from irrelevant views. In particular, we first propose a new cross-view consistency (CVC) strategy that encourages the two sub-nets to learn distinct features from the same input by introducing a feature discrepancy loss, while these distinct features are expected to generate consistent prediction scores of the input. The CVC strategy helps to prevent the two sub-nets from stepping into the collapse. In addition, we further propose a conflict-based pseudo-labelling (CPL) method to guarantee the model will learn more useful information from conflicting predictions, which will lead to a stable training process. We validate our new CCVC approach on the SSS benchmark datasets where our method achieves new state-of-the-art performance. Our code is available at https://github.com/xiaoyao3302/CCVC.

Zicheng Wang, Zhen Zhao, Xiaoxia Xing, Dong Xu, Xiangyu Kong, Luping Zhou• 2023

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
mIoU79.6
166
Semantic segmentationPASCAL VOC classic 2012 (val)--
143
Semantic segmentationPascal VOC blended 2012 (train)
mIoU79
96
Semantic segmentationCityscapes 1/4 (744 labels)
mIoU77.3
91
Semantic segmentationPASCAL VOC Augmented 2012
mIoU79
85
Semantic segmentationCityscapes 1/16 (186 labeled samples)
mIoU74.9
78
Semantic segmentationCITYSCAPES 1/8 labeled samples 372 labels (val)
mIoU76.4
65
Referring Expression SegmentationRefCOCOg UMD (val)
mIoU42.5
52
Semantic segmentationPascal VOC Original protocol 92 labeled images
mIoU70.2
48
Referring Expression SegmentationRefCOCOg UMD (test-u)
mIoU43.49
46
Showing 10 of 38 rows

Other info

Code

Follow for update