Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FedClust: Tackling Data Heterogeneity in Federated Learning through Weight-Driven Client Clustering

About

Federated learning (FL) is an emerging distributed machine learning paradigm that enables collaborative training of machine learning models over decentralized devices without exposing their local data. One of the major challenges in FL is the presence of uneven data distributions across client devices, violating the well-known assumption of independent-and-identically-distributed (IID) training samples in conventional machine learning. To address the performance degradation issue incurred by such data heterogeneity, clustered federated learning (CFL) shows its promise by grouping clients into separate learning clusters based on the similarity of their local data distributions. However, state-of-the-art CFL approaches require a large number of communication rounds to learn the distribution similarities during training until the formation of clusters is stabilized. Moreover, some of these algorithms heavily rely on a predefined number of clusters, thus limiting their flexibility and adaptability. In this paper, we propose {\em FedClust}, a novel approach for CFL that leverages the correlation between local model weights and the data distribution of clients. {\em FedClust} groups clients into clusters in a one-shot manner by measuring the similarity degrees among clients based on the strategically selected partial weights of locally trained models. We conduct extensive experiments on four benchmark datasets with different non-IID data settings. Experimental results demonstrate that {\em FedClust} achieves higher model accuracy up to $\sim$45\% as well as faster convergence with a significantly reduced communication cost up to 2.7$\times$ compared to its state-of-the-art counterparts.

Md Sirajul Islam, Simin Javaherian, Fei Xu, Xu Yuan, Li Chen, Nian-Feng Tzeng• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationFMNIST--
33
Federated Image ClassificationSVHN Dirichlet alpha=0.5 (test)
Global Accuracy79.66
7
Federated Image ClassificationFMNIST Dirichlet alpha=0.5 (test)
Global Accuracy85.19
7
Image ClassificationSVHN Dirichlet alpha=0.3 (test)
Global Accuracy78.96
7
Image ClassificationFMNIST Dirichlet alpha=0.3 (test)
Global Accuracy78.73
7
Image ClassificationCIFAR-10 Dirichlet alpha=0.3 (test)
Global Accuracy32.57
7
Federated Image ClassificationCIFAR-100 Dirichlet alpha=0.5 (test)
Global Accuracy5.57
7
Image ClassificationSVHN Dirichlet alpha=0.1
Global Accuracy44.54
7
Image ClassificationCIFAR-100 Dirichlet alpha=0.1
Global Accuracy4.96
7
Federated Image ClassificationCIFAR-10 Dirichlet alpha=0.5 (test)
Global Accuracy22.51
7
Showing 10 of 14 rows

Other info

Follow for update