CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery
About
We tackle the issue of generalized category discovery (GCD). GCD considers the open-world problem of automatically clustering a partially labelled dataset, in which the unlabelled data may contain instances from both novel categories and labelled classes. In this paper, we address the GCD problem with an unknown category number for the unlabelled data. We propose a framework, named CiPR, to bootstrap the representation by exploiting Cross-instance Positive Relations in the partially labelled data for contrastive learning, which have been neglected in existing methods. To obtain reliable cross-instance relations to facilitate representation learning, we introduce a semi-supervised hierarchical clustering algorithm, named selective neighbor clustering (SNC), which can produce a clustering hierarchy directly from the connected components of a graph constructed from selective neighbors. We further present a method to estimate the unknown class number using SNC with a joint reference score that considers clustering indexes of both labelled and unlabelled data, and extend SNC to allow label assignment for the unlabelled instances with a given class number. We thoroughly evaluate our framework on public generic image recognition datasets and challenging fine-grained datasets, and establish a new state-of-the-art. Code: https://github.com/haoosz/CiPR
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Generalized Category Discovery | ImageNet-100 | All Accuracy88.2 | 138 | |
| Generalized Category Discovery | CIFAR-100 | Accuracy (All)90.3 | 133 | |
| Generalized Category Discovery | Stanford Cars | Accuracy (All)66.7 | 128 | |
| Generalized Category Discovery | CUB | Accuracy (All)78.3 | 113 | |
| Generalized Category Discovery | CIFAR-10 | All Accuracy99 | 105 | |
| Generalized Category Discovery | CUB-200 (test) | Overall Accuracy57.1 | 63 | |
| Fine-grained object category discovery | Stanford Cars (test) | Accuracy47 | 38 |