Unbiased Teacher v2: Semi-supervised Object Detection for Anchor-free and Anchor-based Detectors
About
With the recent development of Semi-Supervised Object Detection (SS-OD) techniques, object detectors can be improved by using a limited amount of labeled data and abundant unlabeled data. However, there are still two challenges that are not addressed: (1) there is no prior SS-OD work on anchor-free detectors, and (2) prior works are ineffective when pseudo-labeling bounding box regression. In this paper, we present Unbiased Teacher v2, which shows the generalization of SS-OD method to anchor-free detectors and also introduces Listen2Student mechanism for the unsupervised regression loss. Specifically, we first present a study examining the effectiveness of existing SS-OD methods on anchor-free detectors and find that they achieve much lower performance improvements under the semi-supervised setting. We also observe that box selection with centerness and the localization-based labeling used in anchor-free detectors cannot work well under the semi-supervised setting. On the other hand, our Listen2Student mechanism explicitly prevents misleading pseudo-labels in the training of bounding box regression; we specifically develop a novel pseudo-labeling selection mechanism based on the Teacher and Student's relative uncertainties. This idea contributes to favorable improvement in the regression branch in the semi-supervised setting. Our method, which works for both anchor-free and anchor-based methods, consistently performs favorably against the state-of-the-art methods in VOC, COCO-standard, and COCO-additional.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | COCO 2017 (val) | AP44.75 | 2454 | |
| Object Detection | PASCAL VOC 2007 (test) | -- | 821 | |
| Object Detection | MS-COCO 2017 (val) | -- | 237 | |
| Text Detection | Total-Text (test) | F-Measure87.5 | 126 | |
| Text Detection | ICDAR 2015 (test) | F1 Score85.5 | 108 | |
| Scene Text Spotting | Total-Text (test) | F-measure (None)79.6 | 105 | |
| Object Detection | Pascal VOC | mAP81.29 | 88 | |
| Object Detection | COCO standard (5% labeled) | mAP31.85 | 70 | |
| Object Detection | COCO standard 2017 (train val) | AP (IoU 0.5:0.95)32.61 | 64 | |
| End-to-End Text Spotting | ICDAR 2015 (test) | Generic F-measure67.5 | 62 |