Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering

About

We propose a self-supervised Gaussian ATtention network for image Clustering (GATCluster). Rather than extracting intermediate features first and then performing the traditional clustering algorithm, GATCluster directly outputs semantic cluster labels without further post-processing. Theoretically, we give a Label Feature Theorem to guarantee the learned features are one-hot encoded vectors, and the trivial solutions are avoided. To train the GATCluster in a completely unsupervised manner, we design four self-learning tasks with the constraints of transformation invariance, separability maximization, entropy analysis, and attention mapping. Specifically, the transformation invariance and separability maximization tasks learn the relationships between sample pairs. The entropy analysis task aims to avoid trivial solutions. To capture the object-oriented semantics, we design a self-supervised attention mechanism that includes a parameterized attention module and a soft-attention loss. All the guiding signals for clustering are self-generated during the training process. Moreover, we develop a two-step learning algorithm that is memory-efficient for clustering large-size images. Extensive experiments demonstrate the superiority of our proposed method in comparison with the state-of-the-art image clustering benchmarks. Our code has been made publicly available at https://github.com/niuchuangnn/GATCluster.

Chuang Niu, Jun Zhang, Ge Wang, Jimin Liang• 2020

Related benchmarks

TaskDatasetResultRank
Image ClusteringCIFAR-10
NMI0.475
243
Image ClusteringSTL-10
ACC58.3
229
ClusteringCIFAR-10 (test)
Accuracy61
184
Image ClusteringImageNet-10
NMI0.609
166
ClusteringSTL-10 (test)
Accuracy58.3
146
ClusteringCIFAR-100 (test)
ACC28.1
110
Image ClusteringCIFAR-100
ACC28.1
101
ClusteringCIFAR100 20
ACC0.281
93
Image ClusteringImagenet dog-15
NMI32.2
90
ClusteringImageNet-10 (test)
ACC73.9
69
Showing 10 of 14 rows

Other info

Follow for update