Guided Distillation for Semi-Supervised Instance Segmentation

About

Although instance segmentation methods have improved considerably, the dominant paradigm is to rely on fully-annotated training images, which are tedious to obtain. To alleviate this reliance, and boost results, semi-supervised approaches leverage unlabeled data as an additional training signal that limits overfitting to the labeled samples. In this context, we present novel design choices to significantly improve teacher-student distillation models. In particular, we (i) improve the distillation approach by introducing a novel "guided burn-in" stage, and (ii) evaluate different instance segmentation architectures, as well as backbone networks and pre-training strategies. Contrary to previous work which uses only supervised data for the burn-in period of the student model, we also use guidance of the teacher model to exploit unlabeled data in the burn-in period. Our improved distillation approach leads to substantial improvements over previous state-of-the-art results. For example, on the Cityscapes dataset we improve mask-AP from 23.7 to 33.9 when using labels for 10\% of images, and on the COCO dataset we improve mask-AP from 18.3 to 34.1 when using labels for only 1\% of the training data.

Tariq Berrada, Camille Couprie, Karteek Alahari, Jakob Verbeek• 2023

Related benchmarks

Task	Dataset	Result
Instance Segmentation	Cityscapes (val)	AP43.8	247
Instance Segmentation	OrgaSegment (test)	mAP32.6	24
Instance Segmentation	M-OrgaQuant (test)	mAP60.5	24
Instance Segmentation	ADE20K 10% labeled data (val)	maskAP14.2	22
Instance Segmentation	Cityscapes 10% labeled data	Mask AP30.8	13
Instance Segmentation	Cityscapes 10% labeled data (val)	maskAP30.8	11
Instance Segmentation	COCO 1% labels (val)	AP34.1	7
Instance Segmentation	COCO 2% labels (val)	AP38.2	7
Instance Segmentation	COCO 5% labels (val)	AP41.6	7
Instance Segmentation	COCO 10% labels (val)	AP43.1	7

Showing 10 of 13 rows

Other info

Code

Follow for update

@wizwand_team Discord