Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CoLLM-NAS: Collaborative Large Language Models for Efficient Knowledge-Guided Neural Architecture Search

About

The integration of Large Language Models (LLMs) with Neural Architecture Search (NAS) has introduced new possibilities for automating the design of neural architectures. However, most existing methods face critical limitations, including architectural invalidity, computational inefficiency, and inferior performance compared to traditional NAS. In this work, we present Collaborative LLM-based NAS (CoLLM-NAS), a two-stage NAS framework with knowledge-guided search driven by two complementary LLMs. Specifically, we propose a stateful Navigator LLM to guide search direction, a stateless Generator LLM to synthesize high-quality candidates, and a Coordinator module to orchestrate inter-LLM communication and manage evaluation processes. CoLLM-NAS efficiently guides the search process by combining LLMs' inherent knowledge of structured neural architectures with progressive knowledge from iterative feedback and historical trajectory. Experimental results on ImageNet and NAS-Bench-201 show that CoLLM-NAS surpasses existing NAS methods and conventional search algorithms, achieving new state-of-the-art results while significantly reducing search costs by 4--10. Furthermore, CoLLM-NAS consistently enhances the performance and efficiency of various two-stage NAS methods (e.g., OFA, SPOS, and AutoFormer) across diverse search spaces (e.g., MobileNet, ShuffleNet, and AutoFormer), demonstrating its excellent generalization.

Zhe Li, Zhiwei Lin, Yongtao Wang• 2025

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 NAS-Bench-201 (test)
Accuracy94.37
225
Image ClassificationCIFAR-100 NAS-Bench-201 (test)
Accuracy73.44
198
Image ClassificationCIFAR-10 NAS-Bench-201 (val)
Accuracy91.59
169
Image ClassificationImageNet-16-120 NAS-Bench-201 (test)
Accuracy46.79
167
Neural Architecture SearchNAS-Bench-201 ImageNet-16-120 (test)
Accuracy46.79
140
Image ClassificationCIFAR-100 NAS-Bench-201 (val)
Accuracy73.44
139
Image ClassificationImageNet 16-120 NAS-Bench-201 (val)
Accuracy46.62
123
Neural Architecture SearchCIFAR-10 NAS-Bench-201 (val)
Accuracy91.59
111
Neural Architecture SearchNAS-Bench-201 CIFAR-10 (test)
Accuracy94.37
110
Neural Architecture SearchImageNet16-120 NAS-Bench-201 (val)
Accuracy46.62
104
Showing 10 of 12 rows

Other info

Follow for update