Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GFT: Graph Foundation Model with Transferable Tree Vocabulary

About

Inspired by the success of foundation models in applications such as ChatGPT, as graph data has been ubiquitous, one can envision the far-reaching impacts that can be brought by Graph Foundation Models (GFMs) with broader applications in the areas such as scientific research, social network analysis, drug discovery, and e-commerce. Despite the significant progress of pre-trained graph neural networks, there haven't been GFMs that can achieve desired performance on various graph-learning-related tasks. Building GFMs may rely on a vocabulary that encodes transferable patterns shared among different tasks and domains. Unlike image and text, defining such transferable patterns for graphs remains an open question. In this paper, we aim to bridge this gap by rethinking the transferable patterns on graphs as computation trees -- i.e., tree structures derived from the message-passing process. Based on this insight, we propose a cross-task, cross-domain graph foundation model named GFT, short for Graph Foundation model with transferable Tree vocabulary. By treating computation trees as tokens within the transferable vocabulary, GFT improves model generalization and reduces the risk of negative transfer. The theoretical analyses and extensive experimental studies have demonstrated the transferability of computation trees and shown the effectiveness of GFT across diverse tasks and domains in graph learning. The open source code and data are available at https://github.com/Zehong-Wang/GFT.

Zehong Wang, Zheyuan Zhang, Nitesh V Chawla, Chuxu Zhang, Yanfang Ye• 2024

Related benchmarks

TaskDatasetResultRank
Node ClassificationCora
Accuracy74.7
1215
Graph ClassificationPROTEINS
Accuracy74.69
994
Node ClassificationCora (test)
Mean Accuracy77.83
861
Link PredictionFB15k-237 (test)--
419
Node ClassificationPubmed
Accuracy84.79
396
Link PredictionWN18RR (test)--
380
Node ClassificationwikiCS
Accuracy78.37
317
Node ClassificationarXiv
Accuracy39.02
219
Node ClassificationOgbn-arxiv
Accuracy73.58
206
Node ClassificationREDDIT
Accuracy71.37
192
Showing 10 of 54 rows

Other info

Follow for update