Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

$P^2$GNN: Two Prototype Sets to boost GNN Performance

About

Message Passing Graph Neural Networks (MP-GNNs) have garnered attention for addressing various industry challenges, such as user recommendation and fraud detection. However, they face two major hurdles: (1) heavy reliance on local context, often lacking information about the global context or graph-level features, and (2) assumption of strong homophily among connected nodes, struggling with noisy local neighborhoods. To tackle these, we introduce $P^2$GNN, a plug-and-play technique leveraging prototypes to optimize message passing, enhancing the performance of the base GNN model. Our approach views the prototypes in two ways: (1) as universally accessible neighbors for all nodes, enriching global context, and (2) aligning messages to clustered prototypes, offering a denoising effect. We demonstrate the extensibility of our proposed method to all message-passing GNNs and conduct extensive experiments across 18 datasets, including proprietary e-commerce datasets and open-source datasets, on node recommendation and node classification tasks. Results show that $P^2$GNN outperforms production models in e-commerce and achieves the top average rank on open-source datasets, establishing it as a leading approach. Qualitative analysis supports the value of global context and noise mitigation in the local neighborhood in enhancing performance.

Arihant Jain, Gundeep Arora, Anoop Saladi, Chaosheng Dong• 2026

Related benchmarks

TaskDatasetResultRank
Node ClassificationarXiv-year
Accuracy54.88
112
Node ClassificationCornell (60%/20%/20% random)
Accuracy95.41
95
Node ClassificationCora (60/20/20 random split)
Accuracy89.89
91
Node ClassificationTexas (48/32/20)
Mean Accuracy88.65
78
Node ClassificationChameleon (60%/20%/20% random)
Accuracy69.87
72
Node ClassificationWisconsin (48/32/20)
Mean Accuracy88.43
66
Node ClassificationCornell (48/32/20)
Mean Accuracy86.49
66
Node ClassificationCiteseer (48/32/20)
Mean Accuracy (%)77.51
66
Node ClassificationTexas (60% 20% 20% random splits)
Accuracy96.72
62
Node ClassificationChameleon (48/32/20)
Mean Accuracy78.6
49
Showing 10 of 24 rows

Other info

Follow for update