Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Handling Distribution Shifts on Graphs: An Invariance Perspective

About

There is increasing evidence suggesting neural networks' sensitivity to distribution shifts, so that research on out-of-distribution (OOD) generalization comes into the spotlight. Nonetheless, current endeavors mostly focus on Euclidean data, and its formulation for graph-structured data is not clear and remains under-explored, given two-fold fundamental challenges: 1) the inter-connection among nodes in one graph, which induces non-IID generation of data points even under the same environment, and 2) the structural information in the input graph, which is also informative for prediction. In this paper, we formulate the OOD problem on graphs and develop a new invariant learning approach, Explore-to-Extrapolate Risk Minimization (EERM), that facilitates graph neural networks to leverage invariance principles for prediction. EERM resorts to multiple context explorers (specified as graph structure editers in our case) that are adversarially trained to maximize the variance of risks from multiple virtual environments. Such a design enables the model to extrapolate from a single observed environment which is the common case for node-level prediction. We prove the validity of our method by theoretically showing its guarantee of a valid OOD solution and further demonstrate its power on various real-world datasets for handling distribution shifts from artificial spurious features, cross-domain transfers and dynamic graph evolution.

Qitian Wu, Hengrui Zhang, Junchi Yan, David Wipf• 2022

Related benchmarks

TaskDatasetResultRank
Node ClassificationCora
Accuracy91.8
1215
Node ClassificationCiteseer
Accuracy70.08
931
Node ClassificationPubmed
Accuracy78.65
819
Node ClassificationwikiCS
Accuracy77.29
317
Node ClassificationarXiv
Accuracy58.62
219
Node ClassificationCiteseer
Mean Accuracy74.76
90
Node ClassificationProducts
Accuracy82.68
56
Node Classificationogbn-arxiv v1 (test)
Accuracy44.53
52
Node ClassificationGOODCora Covariate shift, degree (test)
Accuracy58.38
44
Node ClassificationTwitch (OOD)
AUROC66.8
36
Showing 10 of 78 rows
...

Other info

Follow for update