SEAL: Semantic-Aware Hierarchical Learning for Generalized Category Discovery
About
This paper investigates the problem of Generalized Category Discovery (GCD). Given a partially labelled dataset, GCD aims to categorize all unlabelled images, regardless of whether they belong to known or unknown classes. Existing approaches typically depend on either single-level semantics or manually designed abstract hierarchies, which limit their generalizability and scalability. To address these limitations, we introduce a SEmantic-aware hierArchical Learning framework (SEAL), guided by naturally occurring and easily accessible hierarchical structures. Within SEAL, we propose a Hierarchical Semantic-Guided Soft Contrastive Learning approach that exploits hierarchical similarity to generate informative soft negatives, addressing the limitations of conventional contrastive losses that treat all negatives equally. Furthermore, a Cross-Granularity Consistency (CGC) module is designed to align the predictions from different levels of granularity. SEAL consistently achieves state-of-the-art performance on fine-grained benchmarks, including the SSB benchmark, Oxford-Pet, and the Herbarium19 dataset, and further demonstrates generalization on coarse-grained datasets. Project page: https://visual-ai.github.io/seal/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Generalized Category Discovery | CIFAR-100 | Accuracy (All)89.8 | 233 | |
| Generalized Category Discovery | Stanford Cars | Accuracy (All)77.7 | 208 | |
| Generalized Category Discovery | CUB | Accuracy (All)76.7 | 186 | |
| Generalized Category Discovery | CIFAR-10 | All Accuracy98.9 | 152 | |
| Generalized Category Discovery | SSB Average | Accuracy (All)76.3 | 33 | |
| Generalized Category Discovery | CIFAR10, CIFAR100, ImageNet-100 | Accuracy (All)93.3 | 20 |