Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Diverse Cotraining Makes Strong Semi-Supervised Segmentor

About

Deep co-training has been introduced to semi-supervised segmentation and achieves impressive results, yet few studies have explored the working mechanism behind it. In this work, we revisit the core assumption that supports co-training: multiple compatible and conditionally independent views. By theoretically deriving the generalization upper bound, we prove the prediction similarity between two models negatively impacts the model's generalization ability. However, most current co-training models are tightly coupled together and violate this assumption. Such coupling leads to the homogenization of networks and confirmation bias which consequently limits the performance. To this end, we explore different dimensions of co-training and systematically increase the diversity from the aspects of input domains, different augmentations and model architectures to counteract homogenization. Our Diverse Co-training outperforms the state-of-the-art (SOTA) methods by a large margin across different evaluation protocols on the Pascal and Cityscapes. For example. we achieve the best mIoU of 76.2%, 77.7% and 80.2% on Pascal with only 92, 183 and 366 labeled images, surpassing the previous best results by more than 5%.

Yijiang Li, Xinjiang Wang, Lihe Yang, Litong Feng, Wayne Zhang, Ying Gao• 2023

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPascal VOC (Original set)
mIoU81.6
105
Semantic segmentationCityscapes 1/4 (744 labels)
mIoU78.5
80
Semantic segmentationCityscapes 1/16 (186 labeled samples)
mIoU75.7
68
Semantic segmentationCITYSCAPES 1/8 labeled samples 372 labels (val)
mIoU77.4
65
Semantic segmentationPascal VOC 1/16 labeled 2012 (train)
mIoU75.7
53
Semantic segmentationPascal VOC Original protocol 92 labeled images
mIoU75.7
48
Semantic segmentationPascal VOC Original protocol 1464 labeled images
mIoU82
36
Semantic segmentationPascal VOC 366 labeled images (Original protocol)
mIoU80.1
34
Semantic segmentationPascal VOC Original protocol 732 labeled images
mIoU80.9
34
Semantic segmentationPascal VOC 183 labeled images (Original protocol)
mIoU77.7
34
Showing 10 of 23 rows

Other info

Follow for update