GiGaMAE: Generalizable Graph Masked Autoencoder via Collaborative Latent Space Reconstruction

About

Self-supervised learning with masked autoencoders has recently gained popularity for its ability to produce effective image or textual representations, which can be applied to various downstream tasks without retraining. However, we observe that the current masked autoencoder models lack good generalization ability on graph data. To tackle this issue, we propose a novel graph masked autoencoder framework called GiGaMAE. Different from existing masked autoencoders that learn node presentations by explicitly reconstructing the original graph components (e.g., features or edges), in this paper, we propose to collaboratively reconstruct informative and integrated latent embeddings. By considering embeddings encompassing graph topology and attribute information as reconstruction targets, our model could capture more generalized and comprehensive knowledge. Furthermore, we introduce a mutual information based reconstruction loss that enables the effective reconstruction of multiple targets. This learning objective allows us to differentiate between the exclusive knowledge learned from a single target and common knowledge shared by multiple targets. We evaluate our method on three downstream tasks with seven datasets as benchmarks. Extensive experiments demonstrate the superiority of GiGaMAE against state-of-the-art baselines. We hope our results will shed light on the design of foundation models on graph-structured data. Our code is available at: https://github.com/sycny/GiGaMAE.

Yucheng Shi, Yushun Dong, Qiaoyu Tan, Jundong Li, Ninghao Liu• 2023

Related benchmarks

Task	Dataset	Result
Node Classification	Citeseer	Accuracy72.31	1037
Node Classification	Photo	Accuracy93.5	285
Node Classification	Computer	Accuracy89.7	186
Node Clustering	Cora	NMI55.7	179
Node Classification	Cora	Accuracy84.72	125
Link Prediction	PubMed (test)	AUC97.5	120
Link Prediction	Cora (test)	AUC0.935	117
Node Classification	Citeseer	Accuracy0.698	63
Graph Clustering	Pubmed	NMI34	61
Node Classification	CS	Accuracy92.4	61

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord