Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Graph Representation Learning Beyond Node and Homophily

About

Unsupervised graph representation learning aims to distill various graph information into a downstream task-agnostic dense vector embedding. However, existing graph representation learning approaches are designed mainly under the node homophily assumption: connected nodes tend to have similar labels and optimize performance on node-centric downstream tasks. Their design is apparently against the task-agnostic principle and generally suffers poor performance in tasks, e.g., edge classification, that demands feature signals beyond the node-view and homophily assumption. To condense different feature signals into the embeddings, this paper proposes PairE, a novel unsupervised graph embedding method using two paired nodes as the basic unit of embedding to retain the high-frequency signals between nodes to support node-related and edge-related tasks. Accordingly, a multi-self-supervised autoencoder is designed to fulfill two pretext tasks: one retains the high-frequency signal better, and another enhances the representation of commonality. Our extensive experiments on a diversity of benchmark datasets clearly show that PairE outperforms the unsupervised state-of-the-art baselines, with up to 101.1\% relative improvement on the edge classification tasks that rely on both the high and low-frequency signals in the pair and up to 82.5\% relative performance gain on the node classification tasks.

You Li, Bei Lin, Binli Luo, Ning Gui• 2022

Related benchmarks

TaskDatasetResultRank
Node ClassificationCora (test)--
687
Node ClassificationPPI (test)
F1 (micro)0.9483
126
Node Embedding LearningPPI
Time (s)167.4
20
Multi-Label ClassificationFlickr 30% (train)
Micro-F183.2
18
Node ClassificationDBLP* multi-label (test)
Micro-F180.58
13
Node ClassificationCornell standard (test)
Micro-F166.73
13
Edge classificationWN18RR+ 30% ratio Multi-label edges (train)
Micro-F174.99
12
Node ClassificationCuneiform multi-label (test)
Micro-F175.12
12
Edge classificationCuneiform (30% train ratio)
Micro-F195.71
12
Edge classificationFB15K237+ Multi-label edges (30% train ratio)
Micro-F189.36
7
Showing 10 of 13 rows

Other info

Code

Follow for update