Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

InfoSculpt: Sculpting the Latent Space for Generalized Category Discovery

About

Generalized Category Discovery (GCD) aims to classify instances from both known and novel categories within a large-scale unlabeled dataset, a critical yet challenging task for real-world, open-world applications. However, existing methods often rely on pseudo-labeling, or two-stage clustering, which lack a principled mechanism to explicitly disentangle essential, category-defining signals from instance-specific noise. In this paper, we address this fundamental limitation by re-framing GCD from an information-theoretic perspective, grounded in the Information Bottleneck (IB) principle. We introduce InfoSculpt, a novel framework that systematically sculpts the representation space by minimizing a dual Conditional Mutual Information (CMI) objective. InfoSculpt uniquely combines a Category-Level CMI on labeled data to learn compact and discriminative representations for known classes, and a complementary Instance-Level CMI on all data to distill invariant features by compressing augmentation-induced noise. These two objectives work synergistically at different scales to produce a disentangled and robust latent space where categorical information is preserved while noisy, instance-specific details are discarded. Extensive experiments on 8 benchmarks demonstrate that InfoSculpt validating the effectiveness of our information-theoretic approach.

Wenwen Liao, Hang Ruan, Jianbo Yu, Yuansong Wang, Qingchao Jiang, Xiaofeng Yang• 2026

Related benchmarks

TaskDatasetResultRank
Generalized Category DiscoveryImageNet-100
All Accuracy85.6
138
Generalized Category DiscoveryCIFAR-100
Accuracy (All)82.2
133
Generalized Category DiscoveryStanford Cars
Accuracy (All)59.5
128
Generalized Category DiscoveryCUB
Accuracy (All)66.8
113
Generalized Category DiscoveryCIFAR-10
All Accuracy97.4
105
Generalized Category DiscoveryFGVC Aircraft
Accuracy (All)57
82
Generalized Category DiscoveryHerbarium19
Score (All Categories)48.5
47
Generalized Category DiscoveryImageNet-1K
Accuracy (All)63
19
Showing 8 of 8 rows

Other info

Follow for update