Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLMatic: Neural Architecture Search via Large Language Models and Quality Diversity Optimization

About

Large Language Models (LLMs) have emerged as powerful tools capable of accomplishing a broad spectrum of tasks. Their abilities span numerous areas, and one area where they have made a significant impact is in the domain of code generation. Here, we propose using the coding abilities of LLMs to introduce meaningful variations to code defining neural networks. Meanwhile, Quality-Diversity (QD) algorithms are known to discover diverse and robust solutions. By merging the code-generating abilities of LLMs with the diversity and robustness of QD solutions, we introduce \texttt{LLMatic}, a Neural Architecture Search (NAS) algorithm. While LLMs struggle to conduct NAS directly through prompts, \texttt{LLMatic} uses a procedural approach, leveraging QD for prompts and network architecture to create diverse and high-performing networks. We test \texttt{LLMatic} on the CIFAR-10 and NAS-bench-201 benchmarks, demonstrating that it can produce competitive networks while evaluating just $2,000$ candidates, even without prior knowledge of the benchmark domain or exposure to any previous top-performing models for the benchmark. The open-sourced code is available in \url{https://github.com/umair-nasir14/LLMatic}.

Muhammad U. Nasir, Sam Earle, Christopher Cleghorn, Steven James, Julian Togelius• 2023

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 NAS-Bench-201 (test)
Accuracy94.26
225
Image ClassificationCIFAR-100 NAS-Bench-201 (test)
Accuracy71.62
198
Image ClassificationImageNet-16-120 NAS-Bench-201 (test)
Accuracy45.87
167
Neural Architecture SearchNAS-Bench-201 ImageNet-16-120 (test)
Accuracy45.87
140
Neural Architecture SearchCIFAR-10 NAS-Bench-201 (val)
Accuracy91.42
111
Neural Architecture SearchNAS-Bench-201 CIFAR-10 (test)
Accuracy94.26
110
Neural Architecture SearchImageNet16-120 NAS-Bench-201 (val)
Accuracy44.98
104
Neural Architecture SearchNAS-Bench-201 CIFAR-100 (test)
Accuracy71.62
103
Neural Architecture SearchCIFAR-100 NAS-Bench-201 (val)
Accuracy71.41
92
Showing 9 of 9 rows

Other info

Follow for update