Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SAMGPT: Text-free Graph Foundation Model for Multi-domain Pre-training and Cross-domain Adaptation

About

Graphs are able to model interconnected entities in many online services, supporting a wide range of applications on the Web. This raises an important question: How can we train a graph foundational model on multiple source domains and adapt to an unseen target domain? A major obstacle is that graphs from different domains often exhibit divergent characteristics. Some studies leverage large language models to align multiple domains based on textual descriptions associated with the graphs, limiting their applicability to text-attributed graphs. For text-free graphs, a few recent works attempt to align different feature distributions across domains, while generally neglecting structural differences. In this work, we propose a novel Structure Alignment framework for text-free Multi-domain Graph Pre-Training and cross-domain adaptation (SAMGPT). It is designed to learn multi-domain knowledge from graphs originating in multiple source domains, which can then be adapted to address applications in an unseen target domain. Specifically, we introduce a set of structure tokens to harmonize structure-based aggregation across source domains during the pre-training phase. Next, for cross-domain adaptation, we design dual prompts, namely, holistic prompts and specific prompts, which adapt unified multi-domain structural knowledge and fine-grained, domain-specific information, respectively, to a target domain. Finally, we conduct comprehensive experiments on seven public datasets to evaluate and analyze the effectiveness of SAMGPT.

Xingtong Yu, Zechuan Gong, Chang Zhou, Yuan Fang, Hui Zhang• 2025

Related benchmarks

TaskDatasetResultRank
Graph ClassificationPROTEINS
Accuracy70.48
1252
Node ClassificationCiteseer
Accuracy46.46
1037
Node ClassificationChameleon
Accuracy38.12
867
Node ClassificationPubmed
Accuracy59.4
865
Node ClassificationWisconsin
Accuracy52.29
864
Node ClassificationCornell
Accuracy59.34
851
Node ClassificationTexas
Accuracy0.3135
801
Node ClassificationSquirrel
Accuracy25.75
786
Node ClassificationPubmed
Accuracy59.1
627
Node ClassificationCora
Accuracy62.76
583
Showing 10 of 71 rows
...

Other info

Follow for update