Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GFT: Graph Foundation Model with Transferable Tree Vocabulary

About

Inspired by the success of foundation models in applications such as ChatGPT, as graph data has been ubiquitous, one can envision the far-reaching impacts that can be brought by Graph Foundation Models (GFMs) with broader applications in the areas such as scientific research, social network analysis, drug discovery, and e-commerce. Despite the significant progress of pre-trained graph neural networks, there haven't been GFMs that can achieve desired performance on various graph-learning-related tasks. Building GFMs may rely on a vocabulary that encodes transferable patterns shared among different tasks and domains. Unlike image and text, defining such transferable patterns for graphs remains an open question. In this paper, we aim to bridge this gap by rethinking the transferable patterns on graphs as computation trees -- i.e., tree structures derived from the message-passing process. Based on this insight, we propose a cross-task, cross-domain graph foundation model named GFT, short for Graph Foundation model with transferable Tree vocabulary. By treating computation trees as tokens within the transferable vocabulary, GFT improves model generalization and reduces the risk of negative transfer. The theoretical analyses and extensive experimental studies have demonstrated the transferability of computation trees and shown the effectiveness of GFT across diverse tasks and domains in graph learning. The open source code and data are available at https://github.com/Zehong-Wang/GFT.

Zehong Wang, Zheyuan Zhang, Nitesh V Chawla, Chuxu Zhang, Yanfang Ye• 2024

Related benchmarks

TaskDatasetResultRank
Node ClassificationCora
Accuracy74.7
885
Node ClassificationCora (test)
Mean Accuracy77.83
687
Link PredictionFB15k-237 (test)--
419
Link PredictionWN18RR (test)--
380
Node ClassificationPubmed
Accuracy84.79
307
Node ClassificationwikiCS
Accuracy78.37
198
Node ClassificationOgbn-arxiv
Accuracy73.58
191
Graph ClassificationHIV
ROC-AUC0.6757
104
Node ClassificationCora Covariate shift (degree split)
OOD Accuracy87.01
50
Node ClassificationPubMed (test)
Accuracy77.7
20
Showing 10 of 40 rows

Other info

Follow for update