InfoSculpt: Sculpting the Latent Space for Generalized Category Discovery
About
Generalized Category Discovery (GCD) aims to classify instances from both known and novel categories within a large-scale unlabeled dataset, a critical yet challenging task for real-world, open-world applications. However, existing methods often rely on pseudo-labeling, or two-stage clustering, which lack a principled mechanism to explicitly disentangle essential, category-defining signals from instance-specific noise. In this paper, we address this fundamental limitation by re-framing GCD from an information-theoretic perspective, grounded in the Information Bottleneck (IB) principle. We introduce InfoSculpt, a novel framework that systematically sculpts the representation space by minimizing a dual Conditional Mutual Information (CMI) objective. InfoSculpt uniquely combines a Category-Level CMI on labeled data to learn compact and discriminative representations for known classes, and a complementary Instance-Level CMI on all data to distill invariant features by compressing augmentation-induced noise. These two objectives work synergistically at different scales to produce a disentangled and robust latent space where categorical information is preserved while noisy, instance-specific details are discarded. Extensive experiments on 8 benchmarks demonstrate that InfoSculpt validating the effectiveness of our information-theoretic approach.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Generalized Category Discovery | ImageNet-100 | All Accuracy85.6 | 138 | |
| Generalized Category Discovery | CIFAR-100 | Accuracy (All)82.2 | 133 | |
| Generalized Category Discovery | Stanford Cars | Accuracy (All)59.5 | 128 | |
| Generalized Category Discovery | CUB | Accuracy (All)66.8 | 113 | |
| Generalized Category Discovery | CIFAR-10 | All Accuracy97.4 | 105 | |
| Generalized Category Discovery | FGVC Aircraft | Accuracy (All)57 | 82 | |
| Generalized Category Discovery | Herbarium19 | Score (All Categories)48.5 | 47 | |
| Generalized Category Discovery | ImageNet-1K | Accuracy (All)63 | 19 |