Graph Coarsening via Convolution Matching for Scalable Graph Neural Network Training
About
Graph summarization as a preprocessing step is an effective and complementary technique for scalable graph neural network (GNN) training. In this work, we propose the Coarsening Via Convolution Matching (CONVMATCH) algorithm and a highly scalable variant, A-CONVMATCH, for creating summarized graphs that preserve the output of graph convolution. We evaluate CONVMATCH on six real-world link prediction and node classification graph datasets, and show it is efficient and preserves prediction performance while significantly reducing the graph size. Notably, CONVMATCH achieves up to 95% of the prediction performance of GNNs on node classification while trained on graphs summarized down to 1% the size of the original graph. Furthermore, on link prediction tasks, CONVMATCH consistently outperforms all baselines, achieving up to a 2x improvement.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Node Classification | Citeseer | Macro-F10.6221 | 59 | |
| Node Classification | Ogbn-arxiv | Overall F138.18 | 40 | |
| Node Classification | Books | Accuracy91.62 | 21 | |
| Graph Coarsening | Citeseer | Running Time (s)4.09 | 17 | |
| Node Classification | Products | Accuracy0.6755 | 13 | |
| Graph Coarsening | Products subset | Memory Usage (MB)1.23e+3 | 4 | |
| Graph Coarsening | OGB-arxiv | Memory Usage (MB)6.53e+3 | 4 | |
| Graph Coarsening | Book | Memory Usage (MB)2.74e+4 | 3 |