Reference-based Category Discovery: Unsupervised Object Detection with Category Awareness
About
Traditional one-shot detection methods have addressed the closed-set problem in object detection, but the high cost of data annotation remains a critical challenge. General unsupervised methods generate pseudo boxes without category labels, thus failing to achieve category-aware classification. To overcome these limitations, we propose Reference-based Category Discovery (RefCD), an unsupervised detector that enables category-aware\footnotemark[1] detection without any manually annotated labels. It leverages feature similarity between predicted objects and unlabeled reference images. Unlike previous unsupervised methods that lack category guidance and one-shot methods which require labeled data, RefCD introduces a carefully designed feature similarity loss to explicitly guide the learning of potential category-specific features. Additionally, RefCD supports category-agnostic detection without reference images, serving as a unified framework. Comprehensive quantitative and qualitative analysis of category-aware and category-agnostic detection results demonstrates its effectiveness, and RefCD can learn category information in an unsupervised paradigm even without category labels.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | COCO 2017 (val) | AP10.6 | 2843 | |
| Object Detection | COCO (novel) | AP (Novel)17.7 | 57 | |
| Object Tracking | VOT 2018 | EAO0.187 | 22 | |
| Category-agnostic object detection | COCO 20k | AP13.3 | 8 | |
| One-shot Object Detection | COCO (novel) | AP15.1 | 8 | |
| Category-agnostic object detection | COCO 2017 (val) | AP0.129 | 7 | |
| One-shot Object Detection | GMOT-40 | AP12.9 | 4 |