Quasi-Equivariant Metanetworks

About

Metanetworks are neural architectures designed to operate directly on pretrained weights to perform downstream tasks. However, the parameter space serves only as a proxy for the underlying function class, and the parameter-function mapping is inherently non-injective: distinct parameter configurations may yield identical input-output behaviors. As a result, metanetworks that rely solely on raw parameters risk overlooking the intrinsic symmetries of the architecture. Reasoning about functional identity is therefore essential for effective metanetwork design, motivating the development of equivariant metanetworks, which incorporate equivariance principles to respect architectural symmetries. Existing approaches, however, typically enforce strict equivariance, which imposes rigid constraints and often leads to sparse and less expressive models. To address this limitation, we introduce the novel concept of quasi-equivariance, which allows metanetworks to move beyond the rigidity of strict equivariance while still preserving functional identity. We lay down a principled basis for this framework and demonstrate its broad applicability across diverse neural architectures, including feedforward, convolutional, and transformer networks. Through empirical evaluation, we show that quasi-equivariant metanetworks achieve good trade-offs between symmetry preservation and representational expressivity. These findings advance the theoretical understanding of weight-space learning and provide a principled foundation for the design of more expressive and functionally robust metanetworks.

Viet-Hoang Tran, An Nguyen, Beno\^it Gu\'erand, Thieu N. Vo, Tan M. Nguyen• 2026

Related benchmarks

Task	Dataset	Result
Performance Prediction	Small CNN Zoo ReLU subset (test)	Kendall’s Tau0.926	35
INR classification	F-MNIST Implicit Neural Representations (test)	Accuracy62.11	21
INR classification	MNIST (test)	Accuracy70.21	18
Weight-space INR classification	CIFAR-10 (test)	Test Accuracy35.32	15
INR classification	CIFAR-10 (test)	Accuracy35.32	13
INR editing (dilate)	MNIST (test)	MSE0.066	13
Predicting Transformer Generalization	MNIST-Transformers No threshold	Kendall's Tau0.911	8
Predicting Transformer Generalization	MNIST-Transformers 20% threshold	Kendall's tau0.905	8
Predicting Transformer Generalization	MNIST-Transformers 40% threshold	Kendall's tau0.898	8
Predicting Transformer Generalization	MNIST-Transformers (80% threshold)	Kendall's Tau0.892	8

Showing 10 of 19 rows

Other info

Follow for update

@wizwand_team Discord