Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Generalized Category Discovery

About

In this paper, we consider a highly general image recognition setting wherein, given a labelled and unlabelled set of images, the task is to categorize all images in the unlabelled set. Here, the unlabelled images may come from labelled classes or from novel ones. Existing recognition methods are not able to deal with this setting, because they make several restrictive assumptions, such as the unlabelled instances only coming from known - or unknown - classes, and the number of unknown classes being known a-priori. We address the more unconstrained setting, naming it 'Generalized Category Discovery', and challenge all these assumptions. We first establish strong baselines by taking state-of-the-art algorithms from novel category discovery and adapting them for this task. Next, we propose the use of vision transformers with contrastive representation learning for this open-world setting. We then introduce a simple yet effective semi-supervised $k$-means method to cluster the unlabelled data into seen and unseen classes automatically, substantially outperforming the baselines. Finally, we also propose a new approach to estimate the number of classes in the unlabelled data. We thoroughly evaluate our approach on public datasets for generic object classification and on fine-grained datasets, leveraging the recent Semantic Shift Benchmark suite. Project page at https://www.robots.ox.ac.uk/~vgg/research/gcd

Sagar Vaze, Kai Han, Andrea Vedaldi, Andrew Zisserman• 2022

Related benchmarks

TaskDatasetResultRank
Generalized Category DiscoveryImageNet-100
All Accuracy81.3
138
Generalized Category DiscoveryCIFAR-100
Accuracy (All)79.6
133
Generalized Category DiscoveryStanford Cars
Accuracy (All)65.7
128
Generalized Category DiscoveryCUB
Accuracy (All)71.9
113
Generalized Category DiscoveryCIFAR-10
All Accuracy97.8
105
Generalized Category DiscoveryFGVC Aircraft
Accuracy (All)55.4
82
Generalized Category DiscoveryCUB-200 (test)
Overall Accuracy71.9
63
Anomaly ClassificationMVTec-AD (test)--
50
Generalized Category DiscoveryHerbarium19
Score (All Categories)35.4
47
Open-world semi-supervised learningCIFAR-100 (test)
Overall Accuracy46.8
40
Showing 10 of 80 rows
...

Other info

Code

Follow for update