Neighborhood Contrastive Learning for Novel Class Discovery
About
In this paper, we address Novel Class Discovery (NCD), the task of unveiling new classes in a set of unlabeled samples given a labeled dataset with known classes. We exploit the peculiarities of NCD to build a new framework, named Neighborhood Contrastive Learning (NCL), to learn discriminative representations that are important to clustering performance. Our contribution is twofold. First, we find that a feature extractor trained on the labeled set generates representations in which a generic query sample and its neighbors are likely to share the same class. We exploit this observation to retrieve and aggregate pseudo-positive pairs with contrastive learning, thus encouraging the model to learn more discriminative representations. Second, we notice that most of the instances are easily discriminated by the network, contributing less to the contrastive loss. To overcome this issue, we propose to generate hard negatives by mixing labeled and unlabeled samples in the feature space. We experimentally demonstrate that these two ingredients significantly contribute to clustering performance and lead our model to outperform state-of-the-art methods by a large margin (e.g., clustering accuracy +13% on CIFAR-100 and +8% on ImageNet).
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel Class Discovery | CIFAR-10 (unlabelled set) | Clustering Accuracy93.4 | 21 | |
| Novel Class Discovery | CIFAR-100 (unlabelled set) | Clustering Accuracy86.6 | 21 | |
| Novel Class Discovery | CIFAR-100 | ACC (Seen)0.5313 | 19 | |
| Clustering | ImageNet unlabelled (train) | Clustering Accuracy90.7 | 14 | |
| Clustering | CIFAR10 unlabelled (train) | Clustering Accuracy93.4 | 14 | |
| Clustering | CIFAR100-20 unlabelled (train) | Clustering Accuracy86.6 | 13 | |
| Novel Class Discovery | ImageNet (unlabelled set) | Clustering Accuracy90.7 | 12 | |
| Novel Class Discovery | ISIC Task2 2019 | Accuracy47.62 | 7 | |
| Novel Class Discovery | Tiny-ImageNet | Accuracy (Seen)35.27 | 7 | |
| Novel Class Discovery | ISIC Task1 2019 | ACC59.41 | 7 |