FLEX-MoE: Federated Mixture-of-Experts with Load-balanced Expert Assignment for Edge Computing
About
Mixture-of-Experts (MoE) models enable scalable neural networks through conditional computation, offering enhanced effectiveness and efficiency for next-generation wireless communications. However, deploying MoE with federated learning (FL) over wireless and IoT edge networks faces two critical challenges: 1) resource-constrained clients cannot store large AI models with full expert sets, and 2) non-IID data distributions cause severe expert load imbalance that degrades model performance. To this end, we propose FLEX-MoE, a federated MoE framework that jointly optimizes expert assignment and load balancing under limited client capacity. Specifically, our approach introduces client-expert fitness scores that quantify expert suitability for local datasets through training feedback, and employs an optimization-based algorithm to maximize client-expert specialization while enforcing balanced expert utilization system-wide. Unlike greedy methods that focus solely on personalization while ignoring load imbalance, FLEX-MoE addresses expert utilization skew, which is particularly severe in heterogeneous edge FL. Our experimental results demonstrate superior accuracy and consistently balanced expert utilization across diverse resource-constrained scenarios for edge computing.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR10 0.1-Dirichlet (test) | -- | 38 | |
| Image Classification | CIFAR10 Non-IID (4-Class partition) | Accuracy65.51 | 19 | |
| Image Classification | EMNIST (Non-IID (4-Class partition)) | Accuracy82.57 | 12 | |
| Image Classification | CIFAR10 IID | Accuracy83 | 5 | |
| Image Classification | EMNIST (IID) | Accuracy0.9833 | 5 | |
| Image Classification | GTSRB (IID) | Accuracy (%)98 | 5 | |
| Image Classification | CIFAR10 Non-IID (Dirichlet alpha = 0.8) | Accuracy0.6881 | 5 | |
| Image Classification | EMNIST Non-IID (Dirichlet alpha = 0.8) | Accuracy86.91 | 5 | |
| Image Classification | GTSRB Non-IID (Dirichlet alpha = 0.8) | Accuracy76.5 | 5 | |
| Image Classification | CIFAR10 Non-IID 2-Class partition (test) | Accuracy61 | 5 |