FLEX-MoE: Federated Mixture-of-Experts with Load-balanced Expert Assignment for Edge Computing

About

Mixture-of-Experts (MoE) models enable scalable neural networks through conditional computation, offering enhanced effectiveness and efficiency for next-generation wireless communications. However, deploying MoE with federated learning (FL) over wireless and IoT edge networks faces two critical challenges: 1) resource-constrained clients cannot store large AI models with full expert sets, and 2) non-IID data distributions cause severe expert load imbalance that degrades model performance. To this end, we propose FLEX-MoE, a federated MoE framework that jointly optimizes expert assignment and load balancing under limited client capacity. Specifically, our approach introduces client-expert fitness scores that quantify expert suitability for local datasets through training feedback, and employs an optimization-based algorithm to maximize client-expert specialization while enforcing balanced expert utilization system-wide. Unlike greedy methods that focus solely on personalization while ignoring load imbalance, FLEX-MoE addresses expert utilization skew, which is particularly severe in heterogeneous edge FL. Our experimental results demonstrate superior accuracy and consistently balanced expert utilization across diverse resource-constrained scenarios for edge computing.

Boyang Zhang, Xiaobing Chen, Songyang Zhang, Shuai Zhang, Xiangwei Zhou, Jian Zhang, Mingxuan Sun• 2025

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR10 0.1-Dirichlet (test)	--	38
Image Classification	CIFAR10 Non-IID (4-Class partition)	Accuracy65.51	19
Image Classification	EMNIST (Non-IID (4-Class partition))	Accuracy82.57	12
Image Classification	CIFAR10 IID	Accuracy83	5
Image Classification	EMNIST (IID)	Accuracy0.9833	5
Image Classification	GTSRB (IID)	Accuracy (%)98	5
Image Classification	CIFAR10 Non-IID (Dirichlet alpha = 0.8)	Accuracy0.6881	5
Image Classification	EMNIST Non-IID (Dirichlet alpha = 0.8)	Accuracy86.91	5
Image Classification	GTSRB Non-IID (Dirichlet alpha = 0.8)	Accuracy76.5	5
Image Classification	CIFAR10 Non-IID 2-Class partition (test)	Accuracy61	5

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord