Learning Graph Foundation Models on Riemannian Graph-of-Graphs
About
Graph foundation models (GFMs), pretrained on massive graph data, have transformed graph machine learning by supporting general-purpose reasoning across diverse graph tasks and domains. Existing GFMs pretrained with fixed-hop subgraph sampling impose a fixed receptive field, causing scale mismatch on diverse tasks, which often require heterogeneous and unknown structural contexts beyond a fixed sampling scale. We propose R-GFM, a Riemannian Graph-of-Graphs (GoG) based foundation model, that treats structural scale as a first-class citizen in modeling. R-GFM constructs a multi-scale GoG over-sampled subgraphs at different hop distances and learns geometry-adaptive representations from Riemannian manifolds. Theoretical analysis shows that R-GFM reduces structural domain generalization error compared to fixed-scale GFMs. Experiments on various datasets demonstrate that R-GFM achieves state-of-the-art performance, with up to a 49% relative improvement on downstream tasks. Our code is available at https://github.com/USTC-DataDarknessLab/R-GFM.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Node Classification | Citeseer | Accuracy73.98 | 1037 | |
| Node Classification | Wisconsin | Accuracy47.75 | 864 | |
| Node Classification | Cornell | Accuracy47.97 | 851 | |
| Node Classification | Texas | Accuracy0.5164 | 801 | |
| Node Classification | Cora | Accuracy63.77 | 583 | |
| Node Classification | Pubmed | Accuracy63.39 | 363 | |
| Node Classification | Photo | Accuracy73.62 | 254 | |
| Link Prediction | Citeseer | AUC90.88 | 162 | |
| Link Prediction | Pubmed | AUC88.62 | 156 | |
| Graph Classification | HIV | ROC-AUC0.6648 | 155 |