Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Asynchronous Federated Clustering with Unknown Number of Clusters

About

Federated Clustering (FC) is crucial to mining knowledge from unlabeled non-Independent Identically Distributed (non-IID) data provided by multiple clients while preserving their privacy. Most existing attempts learn cluster distributions at local clients, and then securely pass the desensitized information to the server for aggregation. However, some tricky but common FC problems are still relatively unexplored, including the heterogeneity in terms of clients' communication capacity and the unknown number of proper clusters $k^*$. To further bridge the gap between FC and real application scenarios, this paper first shows that the clients' communication asynchrony and unknown $k^*$ are complex coupling problems, and then proposes an Asynchronous Federated Cluster Learning (AFCL) method accordingly. It spreads the excessive number of seed points to the clients as a learning medium and coordinates them across the clients to form a consensus. To alleviate the distribution imbalance cumulated due to the unforeseen asynchronous uploading from the heterogeneous clients, we also design a balancing mechanism for seeds updating. As a result, the seeds gradually adapt to each other to reveal a proper number of clusters. Extensive experiments demonstrate the efficacy of AFCL.

Yunfan Zhang, Yiqun Zhang, Yang Lu, Mengke Li, Xi Chen, Yiu-ming Cheung• 2024

Related benchmarks

TaskDatasetResultRank
Image ClusteringCIFAR-10--
243
ClusteringFMNIST--
31
ClusteringMNIST
ARI0.038
19
ClusteringEP
Purity0.341
18
ClusteringEC
Purity0.471
18
ClusteringVE
Purity29.8
18
ClusteringEMNIST
ARI8.9
17
ClusteringYE
ARI0.003
16
Global ClusteringEC
SC0.273
9
Federated ClusteringEC
NMI0.053
9
Showing 10 of 67 rows

Other info

Follow for update