RADE: Random Add-Drop Edge as a Regularizer
About
Graph Neural Networks (GNNs) suffer from overfitting and over-squashing of long-range information. Stochastic graph augmentations (e.g., edge deletion) regularize training against overfitting but can introduce train-inference misalignment and do not improve over-squashing. In contrast, rewiring methods improve connectivity to mitigate over-squashing, but are not designed to regularize training. We propose Random Add-Drop Edge (RADE), a stochastic graph augmentation method that jointly drops and adds edges to address both overfitting and over-squashing simultaneously. RADE is provably designed to align training and inference so that random augmentations regularize training without distribution shift, while supporting long-range communication at inference. We further propose and study a mini-batch gradient-norm balancing algorithm that adapts deletion and addition rates during training, rendering RADE hyperparameter-free in practice. Experiments on node- and graph-classification benchmarks show that RADE is a strong regularizer and mitigates over-squashing. Ablations support the roles of train-inference alignment, adaptive rate selection, and the complementary effects of random edge deletion and edge addition.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Graph Classification | PROTEINS | Accuracy75.95 | 1252 | |
| Graph Classification | MUTAG | Accuracy86.24 | 1103 | |
| Node Classification | Cora | Accuracy80.32 | 583 | |
| Node Classification | Citeseer | Accuracy66.34 | 503 | |
| Graph Classification | IMDB-B | Accuracy76.6 | 425 | |
| Graph Classification | IMDB-M | Accuracy52.24 | 425 | |
| Node Classification | Ogbn-arxiv | Accuracy71.85 | 304 | |
| Node Classification | Physics | Accuracy96.58 | 205 | |
| Node Classification | CS | Accuracy93.01 | 175 | |
| Node Classification | Computer | Accuracy90.5 | 159 |