LLMatic: Neural Architecture Search via Large Language Models and Quality Diversity Optimization
About
Large Language Models (LLMs) have emerged as powerful tools capable of accomplishing a broad spectrum of tasks. Their abilities span numerous areas, and one area where they have made a significant impact is in the domain of code generation. Here, we propose using the coding abilities of LLMs to introduce meaningful variations to code defining neural networks. Meanwhile, Quality-Diversity (QD) algorithms are known to discover diverse and robust solutions. By merging the code-generating abilities of LLMs with the diversity and robustness of QD solutions, we introduce \texttt{LLMatic}, a Neural Architecture Search (NAS) algorithm. While LLMs struggle to conduct NAS directly through prompts, \texttt{LLMatic} uses a procedural approach, leveraging QD for prompts and network architecture to create diverse and high-performing networks. We test \texttt{LLMatic} on the CIFAR-10 and NAS-bench-201 benchmarks, demonstrating that it can produce competitive networks while evaluating just $2,000$ candidates, even without prior knowledge of the benchmark domain or exposure to any previous top-performing models for the benchmark. The open-sourced code is available in \url{https://github.com/umair-nasir14/LLMatic}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR-10 NAS-Bench-201 (test) | Accuracy94.26 | 225 | |
| Image Classification | CIFAR-100 NAS-Bench-201 (test) | Accuracy71.62 | 198 | |
| Image Classification | ImageNet-16-120 NAS-Bench-201 (test) | Accuracy45.87 | 167 | |
| Neural Architecture Search | NAS-Bench-201 ImageNet-16-120 (test) | Accuracy45.87 | 140 | |
| Neural Architecture Search | CIFAR-10 NAS-Bench-201 (val) | Accuracy91.42 | 111 | |
| Neural Architecture Search | NAS-Bench-201 CIFAR-10 (test) | Accuracy94.26 | 110 | |
| Neural Architecture Search | ImageNet16-120 NAS-Bench-201 (val) | Accuracy44.98 | 104 | |
| Neural Architecture Search | NAS-Bench-201 CIFAR-100 (test) | Accuracy71.62 | 103 | |
| Neural Architecture Search | CIFAR-100 NAS-Bench-201 (val) | Accuracy71.41 | 92 |