Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GraphMAE: Self-Supervised Masked Graph Autoencoders

About

Self-supervised learning (SSL) has been extensively explored in recent years. Particularly, generative SSL has seen emerging success in natural language processing and other AI fields, such as the wide adoption of BERT and GPT. Despite this, contrastive learning-which heavily relies on structural data augmentation and complicated training strategies-has been the dominant approach in graph SSL, while the progress of generative SSL on graphs, especially graph autoencoders (GAEs), has thus far not reached the potential as promised in other fields. In this paper, we identify and examine the issues that negatively impact the development of GAEs, including their reconstruction objective, training robustness, and error metric. We present a masked graph autoencoder GraphMAE that mitigates these issues for generative self-supervised graph pretraining. Instead of reconstructing graph structures, we propose to focus on feature reconstruction with both a masking strategy and scaled cosine error that benefit the robust training of GraphMAE. We conduct extensive experiments on 21 public datasets for three different graph learning tasks. The results manifest that GraphMAE-a simple graph autoencoder with careful designs-can consistently generate outperformance over both contrastive and generative state-of-the-art baselines. This study provides an understanding of graph autoencoders and demonstrates the potential of generative self-supervised pre-training on graphs.

Zhenyu Hou, Xiao Liu, Yukuo Cen, Yuxiao Dong, Hongxia Yang, Chunjie Wang, Jie Tang• 2022

Related benchmarks

TaskDatasetResultRank
Node ClassificationCora
Accuracy84.2
885
Node ClassificationCiteseer
Accuracy73.4
804
Graph ClassificationPROTEINS
Accuracy75.3
742
Node ClassificationPubmed
Accuracy81.1
742
Graph ClassificationMUTAG
Accuracy88.2
697
Node ClassificationCora (test)
Mean Accuracy73.1
687
Graph ClassificationNCI1
Accuracy80.4
460
Node ClassificationCornell
Accuracy50.8
426
Link PredictionFB15k-237 (test)--
419
Node ClassificationTexas
Accuracy0.659
410
Showing 10 of 92 rows
...

Other info

Code

Follow for update