Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LABO: LLM-Accelerated Bayesian Optimization through Broad Exploration and Selective Experimentation

About

The high cost and data scarcity in scientific exploration have motivated the use of large language models (LLMs) as knowledge-driven components in Bayesian optimization (BO). However, existing approaches typically embed LLMs directly into the sampling or surrogate modeling pipeline, without fully leveraging their significantly lower evaluation cost compared to real-world experiments. To address this limitation, we propose LLM-Accelerated Bayesian Optimization (LABO), a framework that combines LLM predictions with experimental observations within a single BO loop. LABO employs a gating criterion to dynamically balance the reliance on LLM predictions versus actual experiments. By leveraging inexpensive LLM evaluations to broadly explore the search space and reserving costly real experiments only for regions with high uncertainty, LABO achieves more sample-efficient optimization. We provide a theoretical analysis with a cumulative regret bound that formalizes this efficiency gain. Empirical results across diverse scientific tasks demonstrate that LABO consistently outperforms existing methods under identical experimental budgets. Our results suggest that LABO offers a practical and theoretically grounded approach for integrating LLMs into scientific discovery workflows.

Zhuo Chen (equal contribution) __INSTITUTION_1__, Xinzhe Yuan (equal contribution) __INSTITUTION_3__, Jianshu Zhang, Jinzong Dong, Ruichen Zhou, Yingchun Niu, Tianhang Zhou, Yu Yang Fredrik Liu, Yuqiang Li, Nanyang Ye, Qinying Gu (1) __INSTITUTION_13__ Shanghai Artificial Intelligence Laboratory, Shanghai, China, (2) School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai, China, (3) Institute for Advanced Study in Mathematics, Harbin Institute of Technology, Harbin, China, (4) School of Computer Science, Shanghai Jiao Tong University, Shanghai, China, (5) School of Automation, Central South University, Changsha, China, (6) College of New Energy, Materials, China University of Petroleum, Beijing, China, (7) College of Carbon Neutrality Future Technology, China University of Petroleum, Beijing, China, (8) DeepVerse PTE. LTD., Singapore)• 2026

Related benchmarks

TaskDatasetResultRank
AutoML Hyperparameter OptimizationHPOBench SVM
Final Objective Value90.9
5
AutoML Hyperparameter OptimizationHPOBench MLP
Final Objective Value94.1
5
Bayesian Optimizationsuperconductor 86D
Final Objective Value91.06
5
Discrete lipid nanoparticle formulationCBD lipid nanoparticle formulation task
Final Objective Score0.898
3
Showing 4 of 4 rows

Other info

Follow for update