Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Scale Equivariant Graph Metanetworks

About

This paper pertains to an emerging machine learning paradigm: learning higher-order functions, i.e. functions whose inputs are functions themselves, $\textit{particularly when these inputs are Neural Networks (NNs)}$. With the growing interest in architectures that process NNs, a recurring design principle has permeated the field: adhering to the permutation symmetries arising from the connectionist structure of NNs. $\textit{However, are these the sole symmetries present in NN parameterizations}$? Zooming into most practical activation functions (e.g. sine, ReLU, tanh) answers this question negatively and gives rise to intriguing new symmetries, which we collectively refer to as $\textit{scaling symmetries}$, that is, non-zero scalar multiplications and divisions of weights and biases. In this work, we propose $\textit{Scale Equivariant Graph MetaNetworks - ScaleGMNs}$, a framework that adapts the Graph Metanetwork (message-passing) paradigm by incorporating scaling symmetries and thus rendering neuron and edge representations equivariant to valid scalings. We introduce novel building blocks, of independent technical interest, that allow for equivariance or invariance with respect to individual scalar multipliers or their product and use them in all components of ScaleGMN. Furthermore, we prove that, under certain expressivity conditions, ScaleGMN can simulate the forward and backward pass of any input feedforward neural network. Experimental results demonstrate that our method advances the state-of-the-art performance for several datasets and activation functions, highlighting the power of scaling symmetries as an inductive bias for NN processing. The source code is publicly available at https://github.com/jkalogero/scalegmn.

Ioannis Kalogeropoulos, Giorgos Bouritsas, Yannis Panagakis• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationMNIST (test)
Accuracy96.59
882
Image ClassificationFashion MNIST (test)
Accuracy80.78
568
ClassificationCIFAR10 (test)
Accuracy38.82
266
INR classificationF-MNIST Implicit Neural Representations (test)
Accuracy80.78
15
Weight-space INR classificationMNIST (test)
Test Accuracy96.59
13
INR DilationMNIST INR
MSE1.89
8
INR classificationCIFAR-10 Implicit Neural Representations (test)
Accuracy38.82
7
INR classificationAugmented CIFAR-10 Implicit Neural Representations (test)
Accuracy56.95
7
Generalization predictionSmallCNN Zoo CIFAR-10-GS ReLU (test)
Kendall's Tau0.9282
6
Generalization predictionSmallCNN Zoo SVHN-GS ReLU (test)
Kendall's Tau0.8689
6
Showing 10 of 13 rows

Other info

Follow for update