InfoSculpt: Sculpting the Latent Space for Generalized Category Discovery

About

Generalized Category Discovery (GCD) aims to classify instances from both known and novel categories within a large-scale unlabeled dataset, a critical yet challenging task for real-world, open-world applications. However, existing methods often rely on pseudo-labeling, or two-stage clustering, which lack a principled mechanism to explicitly disentangle essential, category-defining signals from instance-specific noise. In this paper, we address this fundamental limitation by re-framing GCD from an information-theoretic perspective, grounded in the Information Bottleneck (IB) principle. We introduce InfoSculpt, a novel framework that systematically sculpts the representation space by minimizing a dual Conditional Mutual Information (CMI) objective. InfoSculpt uniquely combines a Category-Level CMI on labeled data to learn compact and discriminative representations for known classes, and a complementary Instance-Level CMI on all data to distill invariant features by compressing augmentation-induced noise. These two objectives work synergistically at different scales to produce a disentangled and robust latent space where categorical information is preserved while noisy, instance-specific details are discarded. Extensive experiments on 8 benchmarks demonstrate that InfoSculpt validating the effectiveness of our information-theoretic approach.

Wenwen Liao, Hang Ruan, Jianbo Yu, Yuansong Wang, Qingchao Jiang, Xiaofeng Yang• 2026

Related benchmarks

Task	Dataset	Result
Generalized Category Discovery	CIFAR-100	Accuracy (All)82.2	268
Generalized Category Discovery	ImageNet-100	All Accuracy85.6	252
Generalized Category Discovery	Stanford Cars	Accuracy (All)59.5	228
Generalized Category Discovery	CUB	Accuracy (All)66.8	186
Generalized Category Discovery	CIFAR-10	All Accuracy97.4	152
Generalized Category Discovery	FGVC Aircraft	Accuracy (All)57	115
Generalized Category Discovery	Herbarium19	Score (All Categories)48.5	71
Generalized Category Discovery	ImageNet-1K	Accuracy (All)63	19

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord