Weisfeiler and Leman go sparse: Towards scalable higher-order graph embeddings

About

Graph kernels based on the $1$-dimensional Weisfeiler-Leman algorithm and corresponding neural architectures recently emerged as powerful tools for (supervised) learning with graphs. However, due to the purely local nature of the algorithms, they might miss essential patterns in the given data and can only handle binary relations. The $k$-dimensional Weisfeiler-Leman algorithm addresses this by considering $k$-tuples, defined over the set of vertices, and defines a suitable notion of adjacency between these vertex tuples. Hence, it accounts for the higher-order interactions between vertices. However, it does not scale and may suffer from overfitting when used in a machine learning setting. Hence, it remains an important open problem to design WL-based graph learning methods that are simultaneously expressive, scalable, and non-overfitting. Here, we propose local variants and corresponding neural architectures, which consider a subset of the original neighborhood, making them more scalable, and less prone to overfitting. The expressive power of (one of) our algorithms is strictly higher than the original algorithm, in terms of ability to distinguish non-isomorphic graphs. Our experimental study confirms that the local algorithms, both kernel and neural architectures, lead to vastly reduced computation times, and prevent overfitting. The kernel version establishes a new state-of-the-art for graph classification on a wide range of benchmark datasets, while the neural version shows promising performance on large-scale molecular regression tasks.

Christopher Morris, Gaurav Rattan, Petra Mutzel• 2019

Related benchmarks

Task	Dataset	Result
Graph Classification	PROTEINS	Accuracy79.3	1252
Graph Classification	NCI1	Accuracy91.4	658
Graph Classification	IMDB-B	Accuracy76.2	425
Graph Classification	ENZYMES	Accuracy57.6	328
Graph Classification	NCI109	Accuracy89.3	267
Graph Classification	PROTEINS (test)	Accuracy75.1	213
Graph Classification	IMDB-B (test)	Accuracy73.3	155
Graph Classification	IMDB MULTI	Accuracy64.2	139
Graph Classification	REDDIT BINARY	Accuracy91.1	124
Graph Classification	PTC FM	Accuracy62.6	70

Showing 10 of 25 rows

Other info

Code

Follow for update

@wizwand_team Discord