FedCoE: Bridging Generalization and Personalization via Federated Coordinated Dual-level MoEs

About

Federated Learning (FL) has emerged as a promising paradigm for privacy-preserving distributed learning. However, existing FL methods face a fundamental challenge. Traditional averaging-based approaches suffer from parameter divergence under non-IID conditions, while personalized FL methods overfit to local data and fail to generalize to new clients (cold-start problem). Mixture-of-Experts naturally addresses this by routing heterogeneous data to specialized experts rather than forcing uniform aggregation. In this paper, we propose FedCoE, a Federated Coordinated dual-level mixture-of-Experts framework that effectively balances global generalization with local personalization. FedCoE maintains multiple independent global expert models on the server and employs a shared gating network to dynamically model client-expert correlations during aggregation, effectively mitigating expert drift and gating inconsistency. To address the cold-start challenge, we introduce an adaptive mechanism that enables new clients to immediately leverage the global expert pool without extensive local training. Extensive experiments demonstrate that FedCoE achieves 78.00% global accuracy and 89.32% personalized accuracy on average, outperforming the baseline by 8.82% and 29.19%, respectively. In cold-start scenarios, FedCoE delivers 77.27% accuracy without any local fine-tuning, outperforming baselines by over 12.54%.

Penglin Dai, Fulian Li, Xincao Xu, Junhua Wang, Lixin Duan, Xiao Wu• 2026

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-10 Dirichlet alpha=0.1	Global Accuracy63.35	17
Federated Image Classification	CIFAR-10 Dirichlet alpha=0.5 (test)	Local Accuracy77.27	11
Image Classification	CIFAR-10	Personalized Accuracy87.61	7
Image Classification	CIFAR-100	Personalized Accuracy65.65	7
Image Classification	Tiny-ImageNet	Personalized Accuracy52.6	7
Image Classification	CIFAR-10 IID	Per-class Accuracy84.48	7
Image Classification	CIFAR-10 Dirichlet (alpha=1)	Local Accuracy85.86	7
Image Classification	CIFAR-10 Dirichlet (alpha=0.5)	Local Accuracy (PAcc)90.26	7
Federated Image Classification	CIFAR-10 alpha=0.1, Cold-Start (test)	Client 1 Accuracy89.8	4

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord