Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS
About
Neural Architecture Search (NAS) has shown great potentials in finding better neural network designs. Sample-based NAS is the most reliable approach which aims at exploring the search space and evaluating the most promising architectures. However, it is computationally very costly. As a remedy, the one-shot approach has emerged as a popular technique for accelerating NAS using weight-sharing. However, due to the weight-sharing of vastly different networks, the one-shot approach is less reliable than the sample-based approach. In this work, we propose BONAS (Bayesian Optimized Neural Architecture Search), a sample-based NAS framework which is accelerated using weight-sharing to evaluate multiple related architectures simultaneously. Specifically, we apply Graph Convolutional Network predictor as a surrogate model for Bayesian Optimization to select multiple related candidate models in each iteration. We then apply weight-sharing to train multiple candidate models simultaneously. This approach not only accelerates the traditional sample-based approach significantly, but also keeps its reliability. This is because weight-sharing among related architectures are more reliable than those in the one-shot approach. Extensive experiments are conducted to verify the effectiveness of our method over many competing algorithms.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | ImageNet Mobile Setting (test) | Top-1 Error24.6 | 165 | |
| Architecture Performance Prediction | NAS-Bench-101 (test) | Kendall's Tau0.8952 | 40 | |
| Neural Architecture Search | NAS-Bench-101 1.0 (test) | -- | 22 | |
| Neural Architecture Search (Performance Prediction) | NAS-Bench-201 (test) | Kendall's Tau0.43 | 18 | |
| Image Classification | CIFAR-10 DARTS search space (test) | Best Test Error2.43 | 14 | |
| Multi-task Neural Architecture Search | TransNAS-Bench-101 Macro level search space 1.0 | Cls O Acc47.06 | 14 | |
| Autoencoder | TransNAS-Bench-101 Micro level search space | SSIM56.73 | 13 | |
| Surface Normal Prediction | TransNAS-Bench-101 Micro level search space | SSIM57.46 | 13 | |
| Object Classification | TransNAS-Bench-101 Micro level search space | Accuracy45.5 | 13 | |
| Room Layout Reconstruction | TransNAS-Bench-101 Micro level search space | L2 Loss60.93 | 13 |