Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Semi-supervised Gaussian Mixture Models for Generalized Category Discovery

About

In this paper, we address the problem of generalized category discovery (GCD), \ie, given a set of images where part of them are labelled and the rest are not, the task is to automatically cluster the images in the unlabelled data, leveraging the information from the labelled data, while the unlabelled data contain images from the labelled classes and also new ones. GCD is similar to semi-supervised learning (SSL) but is more realistic and challenging, as SSL assumes all the unlabelled images are from the same classes as the labelled ones. We also do not assume the class number in the unlabelled data is known a-priori, making the GCD problem even harder. To tackle the problem of GCD without knowing the class number, we propose an EM-like framework that alternates between representation learning and class number estimation. We propose a semi-supervised variant of the Gaussian Mixture Model (GMM) with a stochastic splitting and merging mechanism to dynamically determine the prototypes by examining the cluster compactness and separability. With these prototypes, we leverage prototypical contrastive learning for representation learning on the partially labelled data subject to the constraints imposed by the labelled data. Our framework alternates between these two steps until convergence. The cluster assignment for an unlabelled instance can then be retrieved by identifying its nearest prototype. We comprehensively evaluate our framework on both generic image classification datasets and challenging fine-grained object recognition datasets, achieving state-of-the-art performance.

Bingchen Zhao, Xin Wen, Kai Han• 2023

Related benchmarks

TaskDatasetResultRank
Generalized Category DiscoveryImageNet-100
All Accuracy76.9
138
Generalized Category DiscoveryCIFAR-100
Accuracy (All)77.9
133
Generalized Category DiscoveryStanford Cars
Accuracy (All)42.8
128
Generalized Category DiscoveryCUB
Accuracy (All)55.4
113
Generalized Category DiscoveryCIFAR-10
All Accuracy92.2
105
Generalized Category DiscoveryFGVC Aircraft
Accuracy (All)46.3
82
Fine-grained Image ClassificationFGVC Aircraft
Accuracy (All)43.3
39
Fine-grained Image ClassificationCUB-200
Accuracy (All)52
32
Showing 8 of 8 rows

Other info

Follow for update