Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NodeMixup: Tackling Under-Reaching for Graph Neural Networks

About

Graph Neural Networks (GNNs) have become mainstream methods for solving the semi-supervised node classification problem. However, due to the uneven location distribution of labeled nodes in the graph, labeled nodes are only accessible to a small portion of unlabeled nodes, leading to the \emph{under-reaching} issue. In this study, we firstly reveal under-reaching by conducting an empirical investigation on various well-known graphs. Then, we demonstrate that under-reaching results in unsatisfactory distribution alignment between labeled and unlabeled nodes through systematic experimental analysis, significantly degrading GNNs' performance. To tackle under-reaching for GNNs, we propose an architecture-agnostic method dubbed NodeMixup. The fundamental idea is to (1) increase the reachability of labeled nodes by labeled-unlabeled pairs mixup, (2) leverage graph structures via fusing the neighbor connections of intra-class node pairs to improve performance gains of mixup, and (3) use neighbor label distribution similarity incorporating node degrees to determine sampling weights for node mixup. Extensive experiments demonstrate the efficacy of NodeMixup in assisting GNNs in handling under-reaching. The source code is available at \url{https://github.com/WeigangLu/NodeMixup}.

Weigang Lu, Ziyu Guan, Wei Zhao, Yaming Yang, Long Jin• 2023

Related benchmarks

TaskDatasetResultRank
Node ClassificationCora (semi-supervised)
Accuracy83.54
103
Node ClassificationCite semi-supervised
Accuracy75.12
61
Node ClassificationPubMed semi-supervised
Accuracy81.26
42
Node ClassificationPhysics semi-supervised
Accuracy94.76
30
Node ClassificationCS semi-supervised
Accuracy92.82
30
Node ClassificationCORA inductive setting (test)
Accuracy83.1
22
Node ClassificationCITESEER inductive setting (test)
Accuracy73.1
21
Semi-supervised node classificationOgbn-arxiv
Accuracy0.7005
20
Node ClassificationFlickr semi-supervised 5% training size
Accuracy50.81
15
Node Classificationogbn-arxiv semi-supervised 1% training size
Accuracy64.26
15
Showing 10 of 13 rows

Other info

Follow for update