Parametric Information Maximization for Generalized Category Discovery
About
We introduce a Parametric Information Maximization (PIM) model for the Generalized Category Discovery (GCD) problem. Specifically, we propose a bi-level optimization formulation, which explores a parameterized family of objective functions, each evaluating a weighted mutual information between the features and the latent labels, subject to supervision constraints from the labeled samples. Our formulation mitigates the class-balance bias encoded in standard information maximization approaches, thereby handling effectively both short-tailed and long-tailed data sets. We report extensive experiments and comparisons demonstrating that our PIM model consistently sets new state-of-the-art performances in GCD across six different datasets, more so when dealing with challenging fine-grained problems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Generalized Category Discovery | ImageNet-100 | All Accuracy83.1 | 138 | |
| Generalized Category Discovery | CIFAR-100 | Accuracy (All)78.3 | 133 | |
| Generalized Category Discovery | Stanford Cars | Accuracy (All)43.1 | 128 | |
| Generalized Category Discovery | CUB | Accuracy (All)62.7 | 113 | |
| Generalized Category Discovery | CIFAR-10 | All Accuracy94.7 | 105 | |
| Generalized Category Discovery | CUB-200 (test) | Overall Accuracy62.7 | 63 | |
| Generalized Category Discovery | Herbarium19 | Score (All Categories)42.3 | 47 | |
| Generalized Category Discovery | Herbarium19 (test) | Score (All Categories)42.3 | 37 | |
| Generalized Category Discovery | CIFAR10, CIFAR100, ImageNet-100, CUB, Stanford Cars, and Herbarium19 Average (test) | Accuracy (All)67.4 | 10 |