Supercharging Graph Transformers with Advective Diffusion
About
The capability of generalization is a cornerstone for the success of modern learning systems. For non-Euclidean data, e.g., graphs, that particularly involves topological structures, one important aspect neglected by prior studies is how machine learning models generalize under topological shifts. This paper proposes Advective Diffusion Transformer (AdvDIFFormer), a physics-inspired graph Transformer model designed to address this challenge. The model is derived from advective diffusion equations which describe a class of continuous message passing process with observed and latent topological structures. We show that AdvDIFFormer has provable capability for controlling generalization error with topological shifts, which in contrast cannot be guaranteed by graph diffusion models, i.e., the generalized formulation of common graph neural networks in continuous space. Empirically, the model demonstrates superiority in various predictive tasks across information networks, molecular screening and protein interactions.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Opinion Forecasting | Delhi Election (test) | RMSE12.13 | 24 | |
| Opinion Dynamics Modeling | Israel-Palestine 30 T | RMSE26.17 | 17 | |
| Opinion Dynamics Modeling | COVID-19 (30 T) | RMSE18.72 | 17 | |
| Opinion Dynamics Modeling | U.S. Election (30 T) | RMSE46.17 | 17 | |
| Opinion Dynamics Modeling | U.S. Election 60 T | RMSE50.73 | 17 | |
| Opinion Dynamics Modeling | Delhi Election (60 T) | RMSE13.91 | 17 | |
| Opinion Dynamics Modeling | Israel-Palestine (60 T) | RMSE42.52 | 17 | |
| Opinion Dynamics Modeling | COVID-19 (60 T) | RMSE20.89 | 17 | |
| Opinion Dynamics Modeling | Syn-Consensus | RMSE3.97 | 10 | |
| Opinion Dynamics Modeling | Syn-Polarization | RMSE5.12 | 10 |