Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SAMGPT: Text-free Graph Foundation Model for Multi-domain Pre-training and Cross-domain Adaptation

About

Graphs are able to model interconnected entities in many online services, supporting a wide range of applications on the Web. This raises an important question: How can we train a graph foundational model on multiple source domains and adapt to an unseen target domain? A major obstacle is that graphs from different domains often exhibit divergent characteristics. Some studies leverage large language models to align multiple domains based on textual descriptions associated with the graphs, limiting their applicability to text-attributed graphs. For text-free graphs, a few recent works attempt to align different feature distributions across domains, while generally neglecting structural differences. In this work, we propose a novel Structure Alignment framework for text-free Multi-domain Graph Pre-Training and cross-domain adaptation (SAMGPT). It is designed to learn multi-domain knowledge from graphs originating in multiple source domains, which can then be adapted to address applications in an unseen target domain. Specifically, we introduce a set of structure tokens to harmonize structure-based aggregation across source domains during the pre-training phase. Next, for cross-domain adaptation, we design dual prompts, namely, holistic prompts and specific prompts, which adapt unified multi-domain structural knowledge and fine-grained, domain-specific information, respectively, to a target domain. Finally, we conduct comprehensive experiments on seven public datasets to evaluate and analyze the effectiveness of SAMGPT.

Xingtong Yu, Zechuan Gong, Chang Zhou, Yuan Fang, Hui Zhang• 2025

Related benchmarks

TaskDatasetResultRank
Graph ClassificationPROTEINS
Accuracy70.48
994
Node ClassificationPubmed
Accuracy59.1
396
Node ClassificationCiteseer
Accuracy56.4
393
Node ClassificationwikiCS
Accuracy54.4
317
Node ClassificationarXiv
Accuracy38.14
219
Node ClassificationREDDIT
Accuracy75.12
192
Node ClassificationComputers
Mean Accuracy60.87
169
Graph ClassificationPubmed
Accuracy68
101
Graph ClassificationCiteseer
Accuracy62.4
99
Graph ClassificationWiki CS
Accuracy48.3
96
Showing 10 of 18 rows

Other info

Follow for update