Modeling Inter-Class and Intra-Class Constraints in Novel Class Discovery
About
Novel class discovery (NCD) aims at learning a model that transfers the common knowledge from a class-disjoint labelled dataset to another unlabelled dataset and discovers new classes (clusters) within it. Many methods, as well as elaborate training pipelines and appropriate objectives, have been proposed and considerably boosted performance on NCD tasks. Despite all this, we find that the existing methods do not sufficiently take advantage of the essence of the NCD setting. To this end, in this paper, we propose to model both inter-class and intra-class constraints in NCD based on the symmetric Kullback-Leibler divergence (sKLD). Specifically, we propose an inter-class sKLD constraint to effectively exploit the disjoint relationship between labelled and unlabelled classes, enforcing the separability for different classes in the embedding space. In addition, we present an intra-class sKLD constraint to explicitly constrain the intra-relationship between a sample and its augmentations and ensure the stability of the training process at the same time. We conduct extensive experiments on the popular CIFAR10, CIFAR100 and ImageNet benchmarks and successfully demonstrate that our method can establish a new state of the art and can achieve significant performance improvements, e.g., 3.5%/3.7% clustering accuracy improvements on CIFAR100-50 dataset split under the task-aware/-agnostic evaluation protocol, over previous state-of-the-art methods. Code is available at https://github.com/FanZhichen/NCD-IIC.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Clustering | CIFAR10 unlabelled (train) | Clustering Accuracy99.1 | 14 | |
| Clustering | ImageNet unlabelled (train) | Clustering Accuracy91.9 | 14 | |
| Clustering | CIFAR100-20 unlabelled (train) | Clustering Accuracy92.4 | 13 | |
| Generalized Category Discovery | CIFAR-10 (test) | -- | 13 | |
| Generalized Category Discovery | SoyAgeing R1 (train) | Overall Accuracy34.34 | 9 | |
| Generalized Category Discovery | SoyAgeing-R3 (train) | Overall Accuracy34.55 | 9 | |
| Generalized Category Discovery | SoyAgeing R6 (train) | All Categories Accuracy32.07 | 9 | |
| Clustering | CIFAR100-50 unlabelled (train) | Clustering Accuracy65.8 | 8 | |
| Generalized Category Discovery | CIFAR-100-20 (test) | Accuracy (Label)75.9 | 7 | |
| Generalized Category Discovery | SoyAgeing-R4 (train) | Accuracy35.71 | 6 |