Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Heterogeneous Graph Masked Autoencoders

About

Generative self-supervised learning (SSL), especially masked autoencoders, has become one of the most exciting learning paradigms and has shown great potential in handling graph data. However, real-world graphs are always heterogeneous, which poses three critical challenges that existing methods ignore: 1) how to capture complex graph structure? 2) how to incorporate various node attributes? and 3) how to encode different node positions? In light of this, we study the problem of generative SSL on heterogeneous graphs and propose HGMAE, a novel heterogeneous graph masked autoencoder model to address these challenges. HGMAE captures comprehensive graph information via two innovative masking techniques and three unique training strategies. In particular, we first develop metapath masking and adaptive attribute masking with dynamic mask rate to enable effective and stable learning on heterogeneous graphs. We then design several training strategies including metapath-based edge reconstruction to adopt complex structural information, target attribute restoration to incorporate various node attributes, and positional feature prediction to encode node positional information. Extensive experiments demonstrate that HGMAE outperforms both contrastive and generative state-of-the-art baselines on several tasks across multiple datasets. Codes are available at https://github.com/meettyj/HGMAE.

Yijun Tian, Kaiwen Dong, Chunhui Zhang, Chuxu Zhang, Nitesh V. Chawla• 2022

Related benchmarks

TaskDatasetResultRank
Node ClassificationACM
Macro F190.5
104
Node ClassificationDBLP
Micro-F193.4
94
Node ClusteringACM
ARI72.6
57
Node ClassificationAminer
Micro F180.3
46
Node ClassificationFreebase
Macro F135.15
43
Node ClusteringDBLP
NMI76.9
39
Drug trafficking detectionTwitter-HetDrug 10% label setting (train)
Macro F1 Score61.84
24
Drug trafficking detectionTwitter-HetDrug 40% label setting (train)
Macro F1 Score65.74
24
Drug trafficking detectionTwitter-HetDrug 20% label setting (train)
Macro-F162.53
24
Node ClassificationYelp
Macro F1 Score90.5
15
Showing 10 of 12 rows

Other info

Follow for update