Graph Representation Learning Beyond Node and Homophily
About
Unsupervised graph representation learning aims to distill various graph information into a downstream task-agnostic dense vector embedding. However, existing graph representation learning approaches are designed mainly under the node homophily assumption: connected nodes tend to have similar labels and optimize performance on node-centric downstream tasks. Their design is apparently against the task-agnostic principle and generally suffers poor performance in tasks, e.g., edge classification, that demands feature signals beyond the node-view and homophily assumption. To condense different feature signals into the embeddings, this paper proposes PairE, a novel unsupervised graph embedding method using two paired nodes as the basic unit of embedding to retain the high-frequency signals between nodes to support node-related and edge-related tasks. Accordingly, a multi-self-supervised autoencoder is designed to fulfill two pretext tasks: one retains the high-frequency signal better, and another enhances the representation of commonality. Our extensive experiments on a diversity of benchmark datasets clearly show that PairE outperforms the unsupervised state-of-the-art baselines, with up to 101.1\% relative improvement on the edge classification tasks that rely on both the high and low-frequency signals in the pair and up to 82.5\% relative performance gain on the node classification tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Node Classification | Cora (test) | -- | 687 | |
| Node Classification | PPI (test) | F1 (micro)0.9483 | 126 | |
| Node Embedding Learning | PPI | Time (s)167.4 | 20 | |
| Multi-Label Classification | Flickr 30% (train) | Micro-F183.2 | 18 | |
| Node Classification | DBLP* multi-label (test) | Micro-F180.58 | 13 | |
| Node Classification | Cornell standard (test) | Micro-F166.73 | 13 | |
| Edge classification | WN18RR+ 30% ratio Multi-label edges (train) | Micro-F174.99 | 12 | |
| Node Classification | Cuneiform multi-label (test) | Micro-F175.12 | 12 | |
| Edge classification | Cuneiform (30% train ratio) | Micro-F195.71 | 12 | |
| Edge classification | FB15K237+ Multi-label edges (30% train ratio) | Micro-F189.36 | 7 |