Implicit Regularization of Mini-Batch Training in Graph Neural Networks
About
Mini-batch training of Graph Neural Networks (GNNs) is fundamentally different from training on i.i.d. data: sampling a subgraph alters the topology and introduces boundary effects, leading prior work to develop structure-aware samplers that preserve local connectivity and reduce embedding variance. Surprisingly, we demonstrate that the simplest possible scheme, Random Node Sampling (RNS), training on the induced subgraph of uniformly sampled nodes, matches or outperforms full-graph training on 8 of 10 datasets at a fraction of the wall-clock time and memory. To explain this, we apply backward error analysis to graph mini-batch Stochastic Gradient Descent (SGD) and show that it implicitly minimizes the sampled loss plus a regularizer proportional to the mini-batch gradient variance, a quantity directly shaped by the sampler. Although RNS discards local structure, it produces mini-batches whose expected loss is closer to the full-graph loss, and whose per-batch gradients have lower variance, yielding a better implicit objective. Our analysis reframes the choice of graph sampler as a form of implicit regularization, and identifies RNS as a strong, theoretically grounded method for scalable GNN training.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Node Classification | Pokec | Accuracy83.98 | 95 | |
| Node Classification | OGBN-Products | Accuracy83.11 | 88 | |
| Binary Classification | city-reviews (RL) | Average Precision78.05 | 21 | |
| Binary Classification | tolokers 2 (RL (Random Low)) | AP54.41 | 21 | |
| Binary Classification | ARTNET-EXP RL | AP42.78 | 2 | |
| Binary Classification | TOLOKERS RH 2 | Average Precision (AP)61.34 | 2 | |
| Binary Classification | CITY-REVIEWS RH | Average Precision80.84 | 2 | |
| Binary Classification | ARTNET-EXP RH | Average Precision (AP)49.28 | 2 | |
| Multiclass Classification | POKEC-REGIONS RL | Accuracy41.25 | 2 | |
| Multiclass Classification | WEB-TOPICS RL | Accuracy47.76 | 2 |