LABO: LLM-Accelerated Bayesian Optimization through Broad Exploration and Selective Experimentation

About

The high cost and data scarcity in scientific exploration have motivated the use of large language models (LLMs) as knowledge-driven components in Bayesian optimization (BO). However, existing approaches typically embed LLMs directly into the sampling or surrogate modeling pipeline, without fully leveraging their significantly lower evaluation cost compared to real-world experiments. To address this limitation, we propose LLM-Accelerated Bayesian Optimization (LABO), a framework that combines LLM predictions with experimental observations within a single BO loop. LABO employs a gating criterion to dynamically balance the reliance on LLM predictions versus actual experiments. By leveraging inexpensive LLM evaluations to broadly explore the search space and reserving costly real experiments only for regions with high uncertainty, LABO achieves more sample-efficient optimization. We provide a theoretical analysis with a cumulative regret bound that formalizes this efficiency gain. Empirical results across diverse scientific tasks demonstrate that LABO consistently outperforms existing methods under identical experimental budgets. Our results suggest that LABO offers a practical and theoretically grounded approach for integrating LLMs into scientific discovery workflows.

Zhuo Chen (equal contribution) __INSTITUTION_1__, Xinzhe Yuan (equal contribution) __INSTITUTION_3__, Jianshu Zhang, Jinzong Dong, Ruichen Zhou, Yingchun Niu, Tianhang Zhou, Yu Yang Fredrik Liu, Yuqiang Li, Nanyang Ye, Qinying Gu (1) __INSTITUTION_13__ Shanghai Artificial Intelligence Laboratory, Shanghai, China, (2) School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai, China, (3) Institute for Advanced Study in Mathematics, Harbin Institute of Technology, Harbin, China, (4) School of Computer Science, Shanghai Jiao Tong University, Shanghai, China, (5) School of Automation, Central South University, Changsha, China, (6) College of New Energy, Materials, China University of Petroleum, Beijing, China, (7) College of Carbon Neutrality Future Technology, China University of Petroleum, Beijing, China, (8) DeepVerse PTE. LTD., Singapore)• 2026

Related benchmarks

Task	Dataset	Result
AutoML Hyperparameter Optimization	HPOBench SVM	Final Objective Value90.9	5
AutoML Hyperparameter Optimization	HPOBench MLP	Final Objective Value94.1	5
Bayesian Optimization	superconductor 86D	Final Objective Value91.06	5
Discrete lipid nanoparticle formulation	CBD lipid nanoparticle formulation task	Final Objective Score0.898	3

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord