Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Topology-Imbalance Learning for Semi-Supervised Node Classification

About

The class imbalance problem, as an important issue in learning node representations, has drawn increasing attention from the community. Although the imbalance considered by existing studies roots from the unequal quantity of labeled examples in different classes (quantity imbalance), we argue that graph data expose a unique source of imbalance from the asymmetric topological properties of the labeled nodes, i.e., labeled nodes are not equal in terms of their structural role in the graph (topology imbalance). In this work, we first probe the previously unknown topology-imbalance issue, including its characteristics, causes, and threats to semi-supervised node classification learning. We then provide a unified view to jointly analyzing the quantity- and topology- imbalance issues by considering the node influence shift phenomenon with the Label Propagation algorithm. In light of our analysis, we devise an influence conflict detection -- based metric Totoro to measure the degree of graph topology imbalance and propose a model-agnostic method ReNode to address the topology-imbalance issue by re-weighting the influence of labeled nodes adaptively based on their relative positions to class boundaries. Systematic experiments demonstrate the effectiveness and generalizability of our method in relieving topology-imbalance issue and promoting semi-supervised node classification. The further analysis unveils varied sensitivity of different graph neural networks (GNNs) to topology imbalance, which may serve as a new perspective in evaluating GNN architectures.

Deli Chen, Yankai Lin, Guangxiang Zhao, Xuancheng Ren, Peng Li, Jie Zhou, Xu Sun• 2021

Related benchmarks

TaskDatasetResultRank
Node ClassificationCora (semi-supervised)--
103
Node ClassificationCS-Random (test)
Balanced Accuracy87.53
72
Node ClassificationCS ρ ≈ 41.0 (random)
Balanced Accuracy87.53
54
Node ClassificationComputers-Random (ρ ≈ 17.7)
Balanced Accuracy81.64
54
Node ClassificationCS Random
F1-score88.91
51
Node ClassificationFlickr
bAcc30.66
48
Node ClassificationComputers Random (test)
Balanced Accuracy (bAcc)81.64
39
Node ClassificationComputers Random rho=25.50 (test)
Balanced Accuracy81.64
33
Node ClassificationCiteseer semi-supervised (test)
Accuracy43.47
26
Node ClassificationOgbn-arxiv naturally imbalanced setting (ρ ≈ 775.4) (test)
bAcc51.12
24
Showing 10 of 17 rows

Other info

Follow for update