ParZC: Parametric Zero-Cost Proxies for Efficient NAS
About
Recent advancements in Zero-shot Neural Architecture Search (NAS) highlight the efficacy of zero-cost proxies in various NAS benchmarks. Several studies propose the automated design of zero-cost proxies to achieve SOTA performance but require tedious searching progress. Furthermore, we identify a critical issue with current zero-cost proxies: they aggregate node-wise zero-cost statistics without considering the fact that not all nodes in a neural network equally impact performance estimation. Our observations reveal that node-wise zero-cost statistics significantly vary in their contributions to performance, with each node exhibiting a degree of uncertainty. Based on this insight, we introduce a novel method called Parametric Zero-Cost Proxies (ParZC) framework to enhance the adaptability of zero-cost proxies through parameterization. To address the node indiscrimination, we propose a Mixer Architecture with Bayesian Network (MABN) to explore the node-wise zero-cost statistics and estimate node-specific uncertainty. Moreover, we propose DiffKendall as a loss function to directly optimize Kendall's Tau coefficient in a differentiable manner so that our ParZC can better handle the discrepancies in ranking architectures. Comprehensive experiments on NAS-Bench-101, 201, and NDS demonstrate the superiority of our proposed ParZC compared to existing zero-shot NAS methods. Additionally, we demonstrate the versatility and adaptability of ParZC by transferring it to the Vision Transformer search space.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | ImageNet 1k (test) | Top-1 Accuracy75.5 | 798 | |
| Image Classification | CIFAR-10 NAS-Bench-201 (test) | Accuracy94.36 | 173 | |
| Image Classification | CIFAR-100 NAS-Bench-201 (test) | Accuracy73.51 | 169 | |
| Image Classification | ImageNet-16-120 NAS-Bench-201 (test) | Accuracy46.34 | 139 | |
| Image Classification | CIFAR-10 NAS-Bench-201 (val) | Accuracy91.55 | 119 | |
| Image Classification | CIFAR-100 NAS-Bench-201 (val) | Accuracy73.49 | 109 | |
| Image Classification | ImageNet 16-120 NAS-Bench-201 (val) | Accuracy46.37 | 96 | |
| Neural Architecture Search | NAS-Bench-101 CIFAR-10 | Spearman Correlation0.832 | 13 | |
| Neural Architecture Search | NAS-Bench-201 CIFAR-10 | Spearman Correlation90.4 | 13 | |
| Neural Architecture Search | NAS-Bench-201 CIFAR-100 | Spearman Correlation0.911 | 13 |