Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Probabilistic Symmetrization for Architecture Agnostic Equivariance

About

We present a novel framework to overcome the limitations of equivariant architectures in learning functions with group symmetries. In contrary to equivariant architectures, we use an arbitrary base model such as an MLP or a transformer and symmetrize it to be equivariant to the given group by employing a small equivariant network that parameterizes the probabilistic distribution underlying the symmetrization. The distribution is end-to-end trained with the base model which can maximize performance while reducing sample complexity of symmetrization. We show that this approach ensures not only equivariance to given group but also universal approximation capability in expectation. We implement our method on various base models, including patch-based transformers that can be initialized from pretrained vision transformers, and test them for a wide range of symmetry groups including permutation and Euclidean groups and their combinations. Empirical tests show competitive results against tailored equivariant architectures, suggesting the potential for learning equivariant functions for diverse groups using a non-equivariant universal base architecture. We further show evidence of enhanced learning in symmetric modalities, like graphs, when pretrained from non-symmetric modalities, like vision. Code is available at https://github.com/jw9730/lps.

Jinwoo Kim, Tien Dat Nguyen, Ayhan Suleymanzade, Hyeokjun An, Seunghoon Hong• 2023

Related benchmarks

TaskDatasetResultRank
Node ClassificationPATTERN (test)
Test Accuracy86.65
88
Graph ClassificationEXP (test)
Accuracy100
33
Link PredictionPCQM-Contact (test)
MRR0.5341
11
Multi-Label ClassificationPeptides-func (test)
AP0.6575
11
RegressionPeptides struct (test)
MAE0.2559
11
Graph SeparationGRAPH8c random initialization
Non-Separated Pairs0.00e+0
11
Graph SeparationEXP random initialization
Non-separated Graph Pairs0.00e+0
11
Position Regressionn-body (test)
Position MSE0.004
9
Showing 8 of 8 rows

Other info

Code

Follow for update