Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MDGMIX: Boundary-Aware Subgraph Mixing for Multi-Domain Graph Pre-Training

About

Multi-domain graph pre-training is a crucial step in constructing foundational graph models with cross-domain generalization capabilities. However, existing methods predominantly rely on jointly training all source domain graphs, resulting in high computational costs. Furthermore, it remains unclear whether all source domain graph data contribute equally to effective transfer. This paper empirically reveals significant data redundancy in multi-domain graph pre-training. Based on this finding, we propose the Multi-domain Graph Pre-training Framework, MDGMIX, which combines boundary-aware subgraph mixing with hierarchical discrimination. By selecting boundary nodes to construct challenging mixed-domain subgraphs, MDGMIX employs coarse-grained domain discrimination and fine-grained domain decomposition losses to decouple shared patterns from domain-specific patterns. During adaptation, MDGMIX employs a lightweight prompt weighting mechanism to transfer source domain knowledge. Extensive experiments demonstrate that MDGMIX consistently outperforms strong baselines in few-shot classification tasks while exhibiting superior time and memory efficiency. The code is available at: https://github.com/zhengziyu77/MDGMIX.

Ziyu Zheng, Yaming Yang, Ziyu Guan, Wei Zhao, Xinyan Huang• 2026

Related benchmarks

TaskDatasetResultRank
Node ClassificationChameleon
Accuracy27.2
867
Node ClassificationPubmed
Accuracy67.36
865
Node ClassificationSquirrel
Accuracy22.92
786
Node ClassificationCora
Accuracy46.83
583
Node ClassificationPubmed
Accuracy55.28
363
Node ClassificationPhoto
Accuracy84.01
254
Node ClassificationPhoto
Accuracy68.4
153
Node ClassificationComputers
Accuracy75.57
145
Graph ClassificationPubmed
Accuracy56.42
114
Graph ClassificationCiteseer
Accuracy46.83
112
Showing 10 of 20 rows

Other info

Follow for update