Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Rethinking Neural Operations for Diverse Tasks

About

An important goal of AutoML is to automate-away the design of neural networks on new tasks in under-explored domains. Motivated by this goal, we study the problem of enabling users to discover the right neural operations given data from their specific domain. We introduce a search space of operations called XD-Operations that mimic the inductive bias of standard multi-channel convolutions while being much more expressive: we prove that it includes many named operations across multiple application areas. Starting with any standard backbone such as ResNet, we show how to transform it into a search space over XD-operations and how to traverse the space using a simple weight-sharing scheme. On a diverse set of tasks -- solving PDEs, distance prediction for protein folding, and music modeling -- our approach consistently yields models with lower error than baseline networks and often even lower error than expert-designed domain-specific approaches.

Nicholas Roberts, Mikhail Khodak, Tri Dao, Liam Li, Christopher R\'e, Ameet Talwalkar• 2021

Related benchmarks

TaskDatasetResultRank
Language ModelingPenn Treebank (test)--
411
PDE solving1d Burgers' equation (test)
Relative Error0.0079
85
2d Darcy Flow2d Darcy Flow (test)
Test Relative Error0.65
51
Protein residue distance predictionPSICOV (test)
MAE84
15
Music ModelingJSB Chorales (test)--
11
Music ModelingNottingham
Average Loss2.84
7
Sequence ModelingPermuted MNIST (test)--
6
Music ModelingJSB Chorales
Average Loss8.07
5
Music ModelingNottingham (test)--
5
Solving 2D Navier-Stokes equations2D Navier-Stokes nu = 10^-4, T = 30
Relative Test Error0.172
3
Showing 10 of 11 rows

Other info

Code

Follow for update