Motif-Driven Contrastive Learning of Graph Representations
About
Pre-training Graph Neural Networks (GNN) via self-supervised contrastive learning has recently drawn lots of attention. However, most existing works focus on node-level contrastive learning, which cannot capture global graph structure. The key challenge to conducting subgraph-level contrastive learning is to sample informative subgraphs that are semantically meaningful. To solve it, we propose to learn graph motifs, which are frequently-occurring subgraph patterns (e.g. functional groups of molecules), for better subgraph sampling. Our framework MotIf-driven Contrastive leaRning Of Graph representations (MICRO-Graph) can: 1) use GNNs to extract motifs from large graph datasets; 2) leverage learned motifs to sample informative subgraphs for contrastive learning of GNN. We formulate motif learning as a differentiable clustering problem, and adopt EM-clustering to group similar and significant subgraphs into several motifs. Guided by these learned motifs, a sampler is trained to generate more informative subgraphs, and these subgraphs are used to train GNNs through graph-to-subgraph contrastive learning. By pre-training on the ogbg-molhiv dataset with MICRO-Graph, the pre-trained GNN achieves 2.04% ROC-AUC average performance enhancement on various downstream benchmark datasets, which is significantly higher than other state-of-the-art self-supervised learning baselines.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Graph Classification | NCI1 | Accuracy74.45 | 460 | |
| Graph Classification | NCI109 | Accuracy76.15 | 223 | |
| Graph Classification | HIV | ROC-AUC0.7673 | 104 | |
| Graph property prediction | Tox21 | ROC-AUC0.7179 | 101 | |
| Graph property prediction | ClinTox | ROC-AUC77.56 | 94 | |
| Graph property prediction | BACE | ROC AUC63.57 | 93 | |
| Graph property prediction | SIDER | ROC AUC60.34 | 87 | |
| Graph property prediction | BBBP | ROC-AUC67.21 | 87 | |
| Graph property prediction | MUV | ROC-AUC0.7046 | 87 | |
| Graph property prediction | ToxCast | ROC-AUC0.608 | 87 |