Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning

About

Multiple clustering aims to discover various latent structures of data from different aspects. Deep multiple clustering methods have achieved remarkable performance by exploiting complex patterns and relationships in data. However, existing works struggle to flexibly adapt to diverse user-specific needs in data grouping, which may require manual understanding of each clustering. To address these limitations, we introduce Multi-Sub, a novel end-to-end multiple clustering approach that incorporates a multi-modal subspace proxy learning framework in this work. Utilizing the synergistic capabilities of CLIP and GPT-4, Multi-Sub aligns textual prompts expressing user preferences with their corresponding visual representations. This is achieved by automatically generating proxy words from large language models that act as subspace bases, thus allowing for the customized representation of data in terms specific to the user's interests. Our method consistently outperforms existing baselines across a broad set of datasets in visual multiple clustering tasks. Our code is available at https://github.com/Alexander-Yao/Multi-Sub.

Jiawei Yao, Qi Qian, Juhua Hu• 2024

Related benchmarks

TaskDatasetResultRank
Image ClusteringCIFAR-10
NMI0.5271
243
Clustering (Species)Fruit360
NMI61.23
24
Clustering (Color)Fruit360
NMI0.6654
24
Clustering (Color)Stanford Cars
NMI75.33
24
Clustering (Color)Flowers
NMI69.4
24
Clustering (Glass)CMUface
NMI0.487
24
Clustering (Pose)CMUface
NMI0.5923
24
ClusteringFruit Color
NMI0.9693
16
Clustering (Identity)CMUface
NMI0.7441
16
Clustering (Order)Card
NMI0.3921
16
Showing 10 of 18 rows

Other info

Code

Follow for update