Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation
About
Sparsely annotated semantic segmentation (SASS) aims to train a segmentation network with coarse-grained (i.e., point-, scribble-, and block-wise) supervisions, where only a small proportion of pixels are labeled in each image. In this paper, we propose a novel tree energy loss for SASS by providing semantic guidance for unlabeled pixels. The tree energy loss represents images as minimum spanning trees to model both low-level and high-level pair-wise affinities. By sequentially applying these affinities to the network prediction, soft pseudo labels for unlabeled pixels are generated in a coarse-to-fine manner, achieving dynamic online self-training. The tree energy loss is effective and easy to be incorporated into existing frameworks by combining it with a traditional segmentation loss. Compared with previous SASS methods, our method requires no multistage training strategies, alternating optimization procedures, additional supervised data, or time-consuming post-processing while outperforming them in all SASS settings. Code is available at https://github.com/megvii-research/TreeEnergyLoss.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | ADE20K (val) | mIoU39.2 | 2731 | |
| Semantic segmentation | PASCAL VOC 2012 (val) | Mean IoU77.3 | 2040 | |
| Semantic segmentation | PASCAL VOC 2012 (test) | mIoU77.5 | 1342 | |
| Semantic segmentation | Cityscapes (val) | mIoU71.5 | 572 | |
| Camouflaged Object Detection | COD10K (test) | S-measure (S_alpha)0.727 | 174 | |
| Instance Segmentation | PASCAL VOC 2012 (val) | mAP @0.565 | 173 | |
| Semantic segmentation | Pascal VOC augmented 2012 (val) | mIoU76.2 | 162 | |
| Semantic segmentation | Pascal VOC 21 classes (val) | mIoU77.1 | 103 | |
| Polyp Segmentation | ETIS (test) | Mean Dice8.3 | 86 | |
| Camouflaged Object Detection | CAMO (test) | S_alpha0.717 | 85 |