Learning Representation for Clustering via Prototype Scattering and Positive Sampling

About

Existing deep clustering methods rely on either contrastive or non-contrastive representation learning for downstream clustering task. Contrastive-based methods thanks to negative pairs learn uniform representations for clustering, in which negative pairs, however, may inevitably lead to the class collision issue and consequently compromise the clustering performance. Non-contrastive-based methods, on the other hand, avoid class collision issue, but the resulting non-uniform representations may cause the collapse of clustering. To enjoy the strengths of both worlds, this paper presents a novel end-to-end deep clustering method with prototype scattering and positive sampling, termed ProPos. Specifically, we first maximize the distance between prototypical representations, named prototype scattering loss, which improves the uniformity of representations. Second, we align one augmented view of instance with the sampled neighbors of another view -- assumed to be truly positive pair in the embedding space -- to improve the within-cluster compactness, termed positive sampling alignment. The strengths of ProPos are avoidable class collision issue, uniform representations, well-separated clusters, and within-cluster compactness. By optimizing ProPos in an end-to-end expectation-maximization framework, extensive experimental results demonstrate that ProPos achieves competing performance on moderate-scale clustering benchmark datasets and establishes new state-of-the-art performance on large-scale datasets. Source code is available at \url{https://github.com/Hzzone/ProPos}.

Zhizhong Huang, Jie Chen, Junping Zhang, Hongming Shan• 2021

Related benchmarks

Task	Dataset	Result
Image Classification	ImageNet-1K	Top-1 Acc72.2	600
Image Clustering	CIFAR-10	NMI0.886	318
Image Clustering	STL-10	ACC86.7	282
Image Clustering	ImageNet-10	NMI0.896	220
Clustering	CIFAR-10 (test)	Accuracy94.3	190
Clustering	STL-10 (test)	Accuracy86.7	152
Clustering	Imagenet Dogs	NMI73.7	105
Clustering	CIFAR100 20	ACC61.4	93
Clustering	CIFAR-100-20 (test)	Accuracy61.4	68
Grouping	Imagenet Dogs	ACC74.5	59

Showing 10 of 20 rows

Other info

Code

Follow for update

@wizwand_team Discord