Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Where Did the Gap Go? Reassessing the Long-Range Graph Benchmark

About

The recent Long-Range Graph Benchmark (LRGB, Dwivedi et al. 2022) introduced a set of graph learning tasks strongly dependent on long-range interaction between vertices. Empirical evidence suggests that on these tasks Graph Transformers significantly outperform Message Passing GNNs (MPGNNs). In this paper, we carefully reevaluate multiple MPGNN baselines as well as the Graph Transformer GPS (Ramp\'a\v{s}ek et al. 2022) on LRGB. Through a rigorous empirical analysis, we demonstrate that the reported performance gap is overestimated due to suboptimal hyperparameter choices. It is noteworthy that across multiple datasets the performance gap completely vanishes after basic hyperparameter optimization. In addition, we discuss the impact of lacking feature normalization for LRGB's vision datasets and highlight a spurious implementation of LRGB's link prediction metric. The principal aim of our paper is to establish a higher standard of empirical rigor within the graph machine learning community.

Jan T\"onshoff, Martin Ritzert, Eran Rosenbluth, Martin Grohe• 2023

Related benchmarks

TaskDatasetResultRank
Graph ClassificationPROTEINS
Accuracy76.8
1252
Node ClassificationChameleon
Accuracy43.6
867
Node ClassificationSquirrel
Accuracy42.2
786
Graph ClassificationNCI1
Accuracy85
658
Node ClassificationCiteseer
Accuracy64.8
503
Graph ClassificationIMDB-B
Accuracy80.2
425
Graph ClassificationIMDB-M
Accuracy47.6
425
Graph ClassificationENZYMES
Accuracy77.7
328
Graph ClassificationDD
Accuracy80.8
300
Graph ClassificationNCI109
Accuracy84.2
267
Showing 10 of 90 rows
...

Other info

Code

Follow for update