Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation

About

Sparsely annotated semantic segmentation (SASS) aims to train a segmentation network with coarse-grained (i.e., point-, scribble-, and block-wise) supervisions, where only a small proportion of pixels are labeled in each image. In this paper, we propose a novel tree energy loss for SASS by providing semantic guidance for unlabeled pixels. The tree energy loss represents images as minimum spanning trees to model both low-level and high-level pair-wise affinities. By sequentially applying these affinities to the network prediction, soft pseudo labels for unlabeled pixels are generated in a coarse-to-fine manner, achieving dynamic online self-training. The tree energy loss is effective and easy to be incorporated into existing frameworks by combining it with a traditional segmentation loss. Compared with previous SASS methods, our method requires no multistage training strategies, alternating optimization procedures, additional supervised data, or time-consuming post-processing while outperforming them in all SASS settings. Code is available at https://github.com/megvii-research/TreeEnergyLoss.

Zhiyuan Liang, Tiancai Wang, Xiangyu Zhang, Jian Sun, Jianbing Shen• 2022

Related benchmarks

Task	Dataset	Result
Semantic segmentation	ADE20K (val)	mIoU39.2	3069
Semantic segmentation	PASCAL VOC 2012 (val)	Mean IoU77.3	2204
Semantic segmentation	PASCAL VOC 2012 (test)	mIoU77.5	1477
Semantic segmentation	Cityscapes	mIoU64.9	668
Semantic segmentation	Cityscapes (val)	mIoU71.5	572
Camouflaged Object Detection	COD10K (test)	S-measure (S_alpha)0.727	306
Instance Segmentation	PASCAL VOC 2012 (val)	mAP @0.565	173
Semantic segmentation	PASCAL VOC 2012 (val)	mIoU77.1	166
Semantic segmentation	Pascal VOC augmented 2012 (val)	mIoU76.2	162
Camouflaged Object Detection	CAMO (test)	M0.133	154

Showing 10 of 29 rows

Other info

Code

Follow for update

@wizwand_team Discord