Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Connecting Low-Loss Subspace for Personalized Federated Learning

About

Due to the curse of statistical heterogeneity across clients, adopting a personalized federated learning method has become an essential choice for the successful deployment of federated learning-based services. Among diverse branches of personalization techniques, a model mixture-based personalization method is preferred as each client has their own personalized model as a result of federated learning. It usually requires a local model and a federated model, but this approach is either limited to partial parameter exchange or requires additional local updates, each of which is helpless to novel clients and burdensome to the client's computational capacity. As the existence of a connected subspace containing diverse low-loss solutions between two or more independent deep networks has been discovered, we combined this interesting property with the model mixture-based personalized federated learning method for improved performance of personalization. We proposed SuPerFed, a personalized federated learning method that induces an explicit connection between the optima of the local and the federated model in weight space for boosting each other. Through extensive experiments on several benchmark datasets, we demonstrated that our method achieves consistent gains in both personalization performance and robustness to problematic scenarios possible in realistic services.

Seok-Ju Hahn, Minwoo Jeong, Junghye Lee• 2021

Related benchmarks

TaskDatasetResultRank
Multi-class classificationCIFAR100 Dirichlet non-IID setting (test)
Top-5 Accuracy62.5
33
Multi-class classificationTinyImageNet Dirichlet non-IID setting (test)
Top-5 Acc52.9
33
Image ClassificationMNIST pair noise 0.1 Dirichlet non-IID (a=100) (test)
ECE14
11
Image ClassificationMNIST pair noise 0.4 Dirichlet non-IID (a=100) (test)
ECE0.28
11
Image ClassificationMNIST symmetric noise 0.2 Dirichlet non-IID (a=100) (test)
ECE27
11
Image ClassificationMNIST symmetric noise 0.6 Dirichlet non-IID (a=100) (test)
ECE30
11
Image ClassificationCIFAR10 pair noise 0.1 Dirichlet non-IID (a=100) (test)
ECE0.28
11
Image ClassificationCIFAR10 pair noise 0.4 Dirichlet non-IID (a=100) (test)
ECE0.27
11
Image ClassificationCIFAR10 symmetric noise 0.2 Dirichlet non-IID (a=100) (test)
ECE31
11
Image ClassificationCIFAR10 symmetric noise 0.6 Dirichlet non-IID (a=100) (test)
ECE28
11
Showing 10 of 18 rows

Other info

Follow for update