Learning How Much to Think: Difficulty-Aware Dynamic MoEs for Graph Node Classification

About

Mixture-of-Experts (MoE) architectures offer a scalable path for Graph Neural Networks (GNNs) in node classification tasks but typically rely on static and rigid routing strategies that enforce a uniform expert budget or coarse-grained expert toggles on all nodes. This limitation overlooks the varying discriminative difficulty of nodes and leads to under-fitting for hard nodes and redundant computation for easy ones. To resolve this issue, we propose D2MoE, a novel framework that shifts the focus from static expert selection to node-wise expert resource allocation. By using predictive entropy as a real-time proxy for difficulty, D2MoE employs a difficulty-driven top-p routing mechanism to adaptively concentrate expert resources on hard nodes while reducing overhead for easy ones, achieving continuous and fine-grained expert budget scaling for node classification. Experiments on 13 benchmarks demonstrate that D2MoE achieves consistent state-of-the-art performance, surpassing leading baselines by up to 7.92% in accuracy on heterophilous graphs. Notably, on large-scale graphs, it reduces memory consumption by up to 73.07% and training time by 46.53% compared to the best-performing Graph MoE, thereby validating its superior efficiency.

Jiajun Zhou, Yadong Li, Xuanze Chen, Chen Ma, Chuang Zhao, Shanqing Yu, Qi Xuan• 2026

Related benchmarks

Task	Dataset	Result
Node Classification	ogbn-arxiv (test)	Accuracy71.72	497
Node Classification	Chameleon (test)	Mean Accuracy50.39	335
Node Classification	Actor (test)	Mean Accuracy0.3826	286
Node Classification	Wiki-CS (test)	Accuracy85.45	146
Node Classification	Photo (test)	Mean Accuracy95.79	125
Node Classification	Coauthor-CS (test)	Accuracy95.82	120
Node Classification	Computers (test)	Mean Accuracy92.23	109
Node Classification	penn94 (test)	Accuracy84.55	48
Node Classification	Squirrel fix (test)	Test Accuracy44.11	23
Node Classification	Facebook (test)	Test Accuracy95.47	22

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord