Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Asynchronous Federated Clustering with Unknown Number of Clusters

About

Federated Clustering (FC) is crucial to mining knowledge from unlabeled non-Independent Identically Distributed (non-IID) data provided by multiple clients while preserving their privacy. Most existing attempts learn cluster distributions at local clients, and then securely pass the desensitized information to the server for aggregation. However, some tricky but common FC problems are still relatively unexplored, including the heterogeneity in terms of clients' communication capacity and the unknown number of proper clusters $k^*$. To further bridge the gap between FC and real application scenarios, this paper first shows that the clients' communication asynchrony and unknown $k^*$ are complex coupling problems, and then proposes an Asynchronous Federated Cluster Learning (AFCL) method accordingly. It spreads the excessive number of seed points to the clients as a learning medium and coordinates them across the clients to form a consensus. To alleviate the distribution imbalance cumulated due to the unforeseen asynchronous uploading from the heterogeneous clients, we also design a balancing mechanism for seeds updating. As a result, the seeds gradually adapt to each other to reveal a proper number of clusters. Extensive experiments demonstrate the efficacy of AFCL.

Yunfan Zhang, Yiqun Zhang, Yang Lu, Mengke Li, Xi Chen, Yiu-ming Cheung• 2024

Related benchmarks

TaskDatasetResultRank
Image ClusteringCIFAR-10--
318
ClusteringFMNIST--
31
ClusteringMNIST
ARI0.038
19
ClusteringEP
Purity0.341
18
ClusteringEC
Purity0.471
18
ClusteringMNIST
Running Time9.28e+3
18
ClusteringVE
Purity29.8
18
ClusteringEMNIST
ARI8.9
17
ClusteringWiki
Clustering Time (s)293.5
16
ClusteringYE
ARI0.003
16
Showing 10 of 75 rows
...

Other info

Follow for update