FedCoE: Bridging Generalization and Personalization via Federated Coordinated Dual-level MoEs
About
Federated Learning (FL) has emerged as a promising paradigm for privacy-preserving distributed learning. However, existing FL methods face a fundamental challenge. Traditional averaging-based approaches suffer from parameter divergence under non-IID conditions, while personalized FL methods overfit to local data and fail to generalize to new clients (cold-start problem). Mixture-of-Experts naturally addresses this by routing heterogeneous data to specialized experts rather than forcing uniform aggregation. In this paper, we propose FedCoE, a Federated Coordinated dual-level mixture-of-Experts framework that effectively balances global generalization with local personalization. FedCoE maintains multiple independent global expert models on the server and employs a shared gating network to dynamically model client-expert correlations during aggregation, effectively mitigating expert drift and gating inconsistency. To address the cold-start challenge, we introduce an adaptive mechanism that enables new clients to immediately leverage the global expert pool without extensive local training. Extensive experiments demonstrate that FedCoE achieves 78.00% global accuracy and 89.32% personalized accuracy on average, outperforming the baseline by 8.82% and 29.19%, respectively. In cold-start scenarios, FedCoE delivers 77.27% accuracy without any local fine-tuning, outperforming baselines by over 12.54%.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR-10 Dirichlet alpha=0.1 | Global Accuracy63.35 | 17 | |
| Federated Image Classification | CIFAR-10 Dirichlet alpha=0.5 (test) | Local Accuracy77.27 | 11 | |
| Image Classification | CIFAR-10 | Personalized Accuracy87.61 | 7 | |
| Image Classification | CIFAR-100 | Personalized Accuracy65.65 | 7 | |
| Image Classification | Tiny-ImageNet | Personalized Accuracy52.6 | 7 | |
| Image Classification | CIFAR-10 IID | Per-class Accuracy84.48 | 7 | |
| Image Classification | CIFAR-10 Dirichlet (alpha=1) | Local Accuracy85.86 | 7 | |
| Image Classification | CIFAR-10 Dirichlet (alpha=0.5) | Local Accuracy (PAcc)90.26 | 7 | |
| Federated Image Classification | CIFAR-10 alpha=0.1, Cold-Start (test) | Client 1 Accuracy89.8 | 4 |