Bootstrapping Semantic Segmentation with Regional Contrast
About
We present ReCo, a contrastive learning framework designed at a regional level to assist learning in semantic segmentation. ReCo performs semi-supervised or supervised pixel-level contrastive learning on a sparse set of hard negative pixels, with minimal additional memory footprint. ReCo is easy to implement, being built on top of off-the-shelf segmentation networks, and consistently improves performance in both semi-supervised and supervised semantic segmentation methods, achieving smoother segmentation boundaries and faster convergence. The strongest effect is in semi-supervised learning with very few labels. With ReCo, we achieve high-quality semantic segmentation models, requiring only 5 examples of each semantic class. Code is available at https://github.com/lorenmt/reco.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | PASCAL VOC 2012 (val) | Mean IoU74.7 | 2040 | |
| Semantic segmentation | PASCAL VOC (val) | mIoU77.75 | 338 | |
| Semantic segmentation | Pascal VOC Original protocol 92 labeled images | mIoU64.8 | 48 | |
| Semantic segmentation | Pascal VOC 183 labeled images (Original protocol) | mIoU72 | 34 | |
| Semantic segmentation | Pascal VOC 366 labeled images (Original protocol) | mIoU73.1 | 34 | |
| Semantic segmentation | Pascal VOC Original protocol 732 labeled images | mIoU74.7 | 34 | |
| Semantic segmentation | Cityscapes (val) | mIoU70.63 | 33 | |
| Semantic segmentation | Pascal VOC PseudoSeg setting (1.4k images) (val) | mIoU74.69 | 32 | |
| Semantic segmentation | PASCAL VOC Original (1/8 partition) 2012 (train) | mIoU72 | 10 | |
| Semantic segmentation | PASCAL VOC Original (1/16 partition) 2012 (train) | mIoU64.8 | 10 |