Where Did the Gap Go? Reassessing the Long-Range Graph Benchmark

About

The recent Long-Range Graph Benchmark (LRGB, Dwivedi et al. 2022) introduced a set of graph learning tasks strongly dependent on long-range interaction between vertices. Empirical evidence suggests that on these tasks Graph Transformers significantly outperform Message Passing GNNs (MPGNNs). In this paper, we carefully reevaluate multiple MPGNN baselines as well as the Graph Transformer GPS (Ramp\'a\v{s}ek et al. 2022) on LRGB. Through a rigorous empirical analysis, we demonstrate that the reported performance gap is overestimated due to suboptimal hyperparameter choices. It is noteworthy that across multiple datasets the performance gap completely vanishes after basic hyperparameter optimization. In addition, we discuss the impact of lacking feature normalization for LRGB's vision datasets and highlight a spurious implementation of LRGB's link prediction metric. The principal aim of our paper is to establish a higher standard of empirical rigor within the graph machine learning community.

Jan T\"onshoff, Martin Ritzert, Eran Rosenbluth, Martin Grohe• 2023

Related benchmarks

Task	Dataset	Result
Graph Classification	PROTEINS	Accuracy76.8	1252
Node Classification	Chameleon	Accuracy43.6	867
Node Classification	Squirrel	Accuracy42.2	786
Graph Classification	NCI1	Accuracy85	658
Node Classification	Citeseer	Accuracy64.8	503
Graph Classification	IMDB-B	Accuracy80.2	425
Graph Classification	IMDB-M	Accuracy47.6	425
Graph Classification	ENZYMES	Accuracy77.7	328
Graph Classification	DD	Accuracy80.8	300
Graph Classification	NCI109	Accuracy84.2	267

Showing 10 of 90 rows

...

Other info

Code

Follow for update

@wizwand_team Discord