Motif-Driven Contrastive Learning of Graph Representations

About

Pre-training Graph Neural Networks (GNN) via self-supervised contrastive learning has recently drawn lots of attention. However, most existing works focus on node-level contrastive learning, which cannot capture global graph structure. The key challenge to conducting subgraph-level contrastive learning is to sample informative subgraphs that are semantically meaningful. To solve it, we propose to learn graph motifs, which are frequently-occurring subgraph patterns (e.g. functional groups of molecules), for better subgraph sampling. Our framework MotIf-driven Contrastive leaRning Of Graph representations (MICRO-Graph) can: 1) use GNNs to extract motifs from large graph datasets; 2) leverage learned motifs to sample informative subgraphs for contrastive learning of GNN. We formulate motif learning as a differentiable clustering problem, and adopt EM-clustering to group similar and significant subgraphs into several motifs. Guided by these learned motifs, a sampler is trained to generate more informative subgraphs, and these subgraphs are used to train GNNs through graph-to-subgraph contrastive learning. By pre-training on the ogbg-molhiv dataset with MICRO-Graph, the pre-trained GNN achieves 2.04% ROC-AUC average performance enhancement on various downstream benchmark datasets, which is significantly higher than other state-of-the-art self-supervised learning baselines.

Shichang Zhang, Ziniu Hu, Arjun Subramonian, Yizhou Sun• 2020

Related benchmarks

Task	Dataset	Result
Graph Classification	NCI1	Accuracy74.45	658
Graph Classification	NCI109	Accuracy76.15	267
Graph Classification	HIV	ROC-AUC0.7673	155
Graph property prediction	BACE	ROC AUC63.57	111
Graph property prediction	Tox21	ROC-AUC0.7179	109
Graph property prediction	ClinTox	ROC-AUC77.56	102
Graph Regression	Peptides struct (test)	MAE0.332	97
Graph property prediction	SIDER	ROC AUC60.34	95
Graph property prediction	MUV	ROC-AUC0.7046	95
Graph property prediction	ToxCast	ROC-AUC0.608	95

Showing 10 of 29 rows

Other info

Follow for update

@wizwand_team Discord