Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hierarchical Textual Knowledge for Enhanced Image Clustering

About

Image clustering aims to group images in an unsupervised fashion. Traditional methods focus on knowledge from visual space, making it difficult to distinguish between visually similar but semantically different classes. Recent advances in vision-language models enable the use of textual knowledge to enhance image clustering. However, most existing methods rely on coarse class labels or simple nouns, overlooking the rich conceptual and attribute-level semantics embedded in textual space. In this paper, we propose a knowledge-enhanced clustering (KEC) method that constructs a hierarchical concept-attribute structured knowledge with the help of large language models (LLMs) to guide clustering. Specifically, we first condense redundant textual labels into abstract concepts and then automatically extract discriminative attributes for each single concept and similar concept pairs, via structured prompts to LLMs. This knowledge is instantiated for each input image to achieve the knowledge-enhanced features. The knowledge-enhanced features with original visual features are adapted to various downstream clustering algorithms. We evaluate KEC on 20 diverse datasets, showing consistent improvements across existing methods using additional textual knowledge. KEC without training outperforms zero-shot CLIP on 14 out of 20 datasets. Furthermore, the naive use of textual knowledge may harm clustering performance, while KEC provides both accuracy and robustness.

Yijie Zhong, Yunfan Gao, Weipeng Jiang, Haofen Wang• 2026

Related benchmarks

TaskDatasetResultRank
Image ClusteringCIFAR-10
NMI0.841
318
Image ClusteringSTL-10
ACC98.5
282
Image ClusteringCIFAR-100
ACC57.1
111
Image ClusteringDTD
NMI63.1
49
ClusteringPets
NMI84.8
21
ClusteringFlowers
NMI (%)92.4
17
ClusteringImageNet
Accuracy55.3
15
Image Clustering20 datasets (average)
NMI58.4
11
Image ClusteringCLEVR
NMI18.8
11
Image ClusteringMNIST
NMI47.8
10
Showing 10 of 10 rows

Other info

Follow for update