Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Large-Scale Representation Learning on Graphs via Bootstrapping

About

Self-supervised learning provides a promising path towards eliminating the need for costly label information in representation learning on graphs. However, to achieve state-of-the-art performance, methods often need large numbers of negative examples and rely on complex augmentations. This can be prohibitively expensive, especially for large graphs. To address these challenges, we introduce Bootstrapped Graph Latents (BGRL) - a graph representation learning method that learns by predicting alternative augmentations of the input. BGRL uses only simple augmentations and alleviates the need for contrasting with negative examples, and is thus scalable by design. BGRL outperforms or matches prior methods on several established benchmarks, while achieving a 2-10x reduction in memory costs. Furthermore, we show that BGRL can be scaled up to extremely large graphs with hundreds of millions of nodes in the semi-supervised regime - achieving state-of-the-art performance and improving over supervised baselines where representations are shaped only through label information. In particular, our solution centered on BGRL constituted one of the winning entries to the Open Graph Benchmark - Large Scale Challenge at KDD Cup 2021, on a graph orders of magnitudes larger than all previously available benchmarks, thus demonstrating the scalability and effectiveness of our approach.

Shantanu Thakoor, Corentin Tallec, Mohammad Gheshlaghi Azar, Mehdi Azabou, Eva L. Dyer, R\'emi Munos, Petar Veli\v{c}kovi\'c, Michal Valko• 2021

Related benchmarks

TaskDatasetResultRank
Node ClassificationCora
Accuracy82.7
1215
Graph ClassificationPROTEINS
Accuracy70.99
994
Node ClassificationCiteseer
Accuracy71.1
931
Graph ClassificationMUTAG
Accuracy74.99
862
Node ClassificationCora (test)
Mean Accuracy84.45
861
Node ClassificationCiteseer (test)
Accuracy0.7484
824
Node ClassificationPubmed
Accuracy79.6
819
Node ClassificationChameleon
Accuracy42.55
640
Node ClassificationWisconsin
Accuracy51.23
627
Node ClassificationTexas
Accuracy0.5277
616
Showing 10 of 63 rows

Other info

Follow for update