Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Distance-Restricted Folklore Weisfeiler-Leman GNNs with Provable Cycle Counting Power

About

The ability of graph neural networks (GNNs) to count certain graph substructures, especially cycles, is important for the success of GNNs on a wide range of tasks. It has been recently used as a popular metric for evaluating the expressive power of GNNs. Many of the proposed GNN models with provable cycle counting power are based on subgraph GNNs, i.e., extracting a bag of subgraphs from the input graph, generating representations for each subgraph, and using them to augment the representation of the input graph. However, those methods require heavy preprocessing, and suffer from high time and memory costs. In this paper, we overcome the aforementioned limitations of subgraph GNNs by proposing a novel class of GNNs -- $d$-Distance-Restricted FWL(2) GNNs, or $d$-DRFWL(2) GNNs. $d$-DRFWL(2) GNNs use node pairs whose mutual distances are at most $d$ as the units for message passing to balance the expressive power and complexity. By performing message passing among distance-restricted node pairs in the original graph, $d$-DRFWL(2) GNNs avoid the expensive subgraph extraction operations in subgraph GNNs, making both the time and space complexity lower. We theoretically show that the discriminative power of $d$-DRFWL(2) GNNs strictly increases as $d$ increases. More importantly, $d$-DRFWL(2) GNNs have provably strong cycle counting power even with $d=2$: they can count all 3, 4, 5, 6-cycles. Since 6-cycles (e.g., benzene rings) are ubiquitous in organic molecules, being able to detect and count them is crucial for achieving robust and generalizable performance on molecular tasks. Experiments on both synthetic datasets and molecular datasets verify our theory. To the best of our knowledge, our model is the most efficient GNN model to date (both theoretically and empirically) that can count up to 6-cycles.

Junru Zhou, Jiarui Feng, Xiyuan Wang, Muhan Zhang• 2023

Related benchmarks

TaskDatasetResultRank
Graph Classificationogbg-molpcba (test)
AP25.38
206
Molecular property predictionQM9 (test)
mu0.346
174
Graph RegressionZINC 12K (test)
MAE0.077
164
Molecular property predictionQM9
Cv0.0901
70
Multi-label Graph ClassificationPEPTIDES-FUNC Long-Range Graph Benchmark (test)
AP59.53
15
Graph RegressionZINC 250K (test)
MAE0.025
13
Graph DistinguishabilityBREC 1.0 (test)
Basic Score60
10
Multi-label regressionPeptides-struct LRGB 1.0 (test)
MAE0.2594
9
Binary Graph ClassificationEXP (10-fold cross val)
Accuracy100
8
Graph ClassificationSR25
Accuracy6.67
8
Showing 10 of 13 rows

Other info

Code

Follow for update