Learning How Much to Think: Difficulty-Aware Dynamic MoEs for Graph Node Classification
About
Mixture-of-Experts (MoE) architectures offer a scalable path for Graph Neural Networks (GNNs) in node classification tasks but typically rely on static and rigid routing strategies that enforce a uniform expert budget or coarse-grained expert toggles on all nodes. This limitation overlooks the varying discriminative difficulty of nodes and leads to under-fitting for hard nodes and redundant computation for easy ones. To resolve this issue, we propose D2MoE, a novel framework that shifts the focus from static expert selection to node-wise expert resource allocation. By using predictive entropy as a real-time proxy for difficulty, D2MoE employs a difficulty-driven top-p routing mechanism to adaptively concentrate expert resources on hard nodes while reducing overhead for easy ones, achieving continuous and fine-grained expert budget scaling for node classification. Experiments on 13 benchmarks demonstrate that D2MoE achieves consistent state-of-the-art performance, surpassing leading baselines by up to 7.92% in accuracy on heterophilous graphs. Notably, on large-scale graphs, it reduces memory consumption by up to 73.07% and training time by 46.53% compared to the best-performing Graph MoE, thereby validating its superior efficiency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Node Classification | ogbn-arxiv (test) | Accuracy71.72 | 433 | |
| Node Classification | Chameleon (test) | Mean Accuracy50.39 | 297 | |
| Node Classification | Actor (test) | Mean Accuracy0.3826 | 237 | |
| Node Classification | Photo (test) | Mean Accuracy95.79 | 92 | |
| Node Classification | Computers (test) | Mean Accuracy92.23 | 91 | |
| Node Classification | Coauthor-CS (test) | Accuracy95.82 | 83 | |
| Node Classification | Wiki-CS (test) | Accuracy85.45 | 75 | |
| Node Classification | penn94 (test) | Accuracy84.55 | 39 | |
| Node Classification | Squirrel fix (test) | Test Accuracy44.11 | 23 | |
| Node Classification | Facebook (test) | Test Accuracy95.47 | 22 |