Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Quasi-Equivariant Metanetworks

About

Metanetworks are neural architectures designed to operate directly on pretrained weights to perform downstream tasks. However, the parameter space serves only as a proxy for the underlying function class, and the parameter-function mapping is inherently non-injective: distinct parameter configurations may yield identical input-output behaviors. As a result, metanetworks that rely solely on raw parameters risk overlooking the intrinsic symmetries of the architecture. Reasoning about functional identity is therefore essential for effective metanetwork design, motivating the development of equivariant metanetworks, which incorporate equivariance principles to respect architectural symmetries. Existing approaches, however, typically enforce strict equivariance, which imposes rigid constraints and often leads to sparse and less expressive models. To address this limitation, we introduce the novel concept of quasi-equivariance, which allows metanetworks to move beyond the rigidity of strict equivariance while still preserving functional identity. We lay down a principled basis for this framework and demonstrate its broad applicability across diverse neural architectures, including feedforward, convolutional, and transformer networks. Through empirical evaluation, we show that quasi-equivariant metanetworks achieve good trade-offs between symmetry preservation and representational expressivity. These findings advance the theoretical understanding of weight-space learning and provide a principled foundation for the design of more expressive and functionally robust metanetworks.

Viet-Hoang Tran, An Nguyen, Beno\^it Gu\'erand, Thieu N. Vo, Tan M. Nguyen• 2026

Related benchmarks

TaskDatasetResultRank
Performance PredictionSmall CNN Zoo ReLU subset (test)
Kendall’s Tau0.926
35
INR classificationF-MNIST Implicit Neural Representations (test)
Accuracy62.11
21
INR classificationCIFAR-10 (test)
Accuracy35.32
13
INR editing (dilate)MNIST (test)
MSE0.066
13
INR classificationMNIST (test)
Accuracy70.21
13
Weight-space INR classificationCIFAR-10 (test)
Test Accuracy35.32
10
Predicting Transformer GeneralizationMNIST-Transformers No threshold
Kendall's Tau0.911
8
Predicting Transformer GeneralizationMNIST-Transformers 20% threshold
Kendall's tau0.905
8
Predicting Transformer GeneralizationMNIST-Transformers 40% threshold
Kendall's tau0.898
8
Predicting Transformer GeneralizationMNIST-Transformers (80% threshold)
Kendall's Tau0.892
8
Showing 10 of 19 rows

Other info

Follow for update