Rethinking Neural Operations for Diverse Tasks
About
An important goal of AutoML is to automate-away the design of neural networks on new tasks in under-explored domains. Motivated by this goal, we study the problem of enabling users to discover the right neural operations given data from their specific domain. We introduce a search space of operations called XD-Operations that mimic the inductive bias of standard multi-channel convolutions while being much more expressive: we prove that it includes many named operations across multiple application areas. Starting with any standard backbone such as ResNet, we show how to transform it into a search space over XD-operations and how to traverse the space using a simple weight-sharing scheme. On a diverse set of tasks -- solving PDEs, distance prediction for protein folding, and music modeling -- our approach consistently yields models with lower error than baseline networks and often even lower error than expert-designed domain-specific approaches.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Language Modeling | Penn Treebank (test) | -- | 411 | |
| PDE solving | 1d Burgers' equation (test) | Relative Error0.0079 | 85 | |
| 2d Darcy Flow | 2d Darcy Flow (test) | Test Relative Error0.65 | 51 | |
| Protein residue distance prediction | PSICOV (test) | MAE84 | 15 | |
| Music Modeling | JSB Chorales (test) | -- | 11 | |
| Music Modeling | Nottingham | Average Loss2.84 | 7 | |
| Sequence Modeling | Permuted MNIST (test) | -- | 6 | |
| Music Modeling | JSB Chorales | Average Loss8.07 | 5 | |
| Music Modeling | Nottingham (test) | -- | 5 | |
| Solving 2D Navier-Stokes equations | 2D Navier-Stokes nu = 10^-4, T = 30 | Relative Test Error0.172 | 3 |