Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision?

About

While great success has been achieved in building vision models with Contrastive Language-Image Pre-training (CLIP) over internet-scale image-text pairs, building transferable Graph Neural Networks (GNNs) with CLIP pipeline is challenging because of the scarcity of labeled data and text supervision, different levels of downstream tasks, and the conceptual gaps between domains. In this work, to address these issues, we propose a multi-modal prompt learning paradigm to effectively adapt pre-trained GNN to downstream tasks and data, given only a few semantically labeled samples, each with extremely weak text supervision. Our new paradigm embeds the graphs directly in the same space as the Large Language Models (LLMs) by learning both graph prompts and text prompts simultaneously. We demonstrate the superior performance of our paradigm in few-shot, multi-task-level, and cross-domain settings. Moreover, we build the first CLIP-style zero-shot classification prototype that can generalize GNNs to unseen classes with extremely weak text supervision. The code is available at https://github.com/Violet24K/Morpher.

Zihao Li, Lecheng Zheng, Bowen Jin, Dongqi Fu, Baoyu Jing, Yikun Ban, Jingrui He, Jiawei Han• 2024

Related benchmarks

Task	Dataset	Result
Graph Classification	PROTEINS	Accuracy73.53	1252
Graph Classification	MUTAG	Accuracy79.33	1103
Graph Classification	Mutag (test)	Accuracy76.67	224
Molecular Property Classification	MoleculeNet BACE	ROC AUC68.58	47
Node Classification	Citeseer	Accuracy0.682	41
Graph-level classification	MUTAG (target)	Accuracy76.67	10
Node-level classification	PubMed (target)	Accuracy58.29	10
Graph Classification	MoleculeNet tox21	AUC-ROC0.7459	8
Graph Classification	MoleculeNet HIV	AUC ROC72.83	8
Graph Classification	MSRC 21C	Accuracy50.85	7

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord