Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adaptive Kernel Design for Bayesian Optimization Is a Piece of CAKE with LLMs

About

The efficiency of Bayesian optimization (BO) relies heavily on the choice of the Gaussian process (GP) kernel, which plays a central role in balancing exploration and exploitation under limited evaluation budgets. Traditional BO methods often rely on fixed or heuristic kernel selection strategies, which can result in slow convergence or suboptimal solutions when the chosen kernel is poorly suited to the underlying objective function. To address this limitation, we propose a freshly-baked Context-Aware Kernel Evolution (CAKE) to enhance BO with large language models (LLMs). Concretely, CAKE leverages LLMs as the crossover and mutation operators to adaptively generate and refine GP kernels based on the observed data throughout the optimization process. To maximize the power of CAKE, we further propose BIC-Acquisition Kernel Ranking (BAKER) to select the most effective kernel through balancing the model fit measured by the Bayesian information criterion (BIC) with the expected improvement at each iteration of BO. Extensive experiments demonstrate that our fresh CAKE-based BO method consistently outperforms established baselines across a range of real-world tasks, including hyperparameter optimization, controller tuning, and photonic chip design. Our code is publicly available at https://github.com/richardcsuwandi/cake.

Richard Cornelius Suwandi, Feng Yin, Juntao Wang, Renjie Li, Tsung-Hui Chang, Sergios Theodoridis• 2025

Related benchmarks

TaskDatasetResultRank
High-Dimensional Bayesian OptimizationMopta08 d = 124
Rank9.8
22
High-Dimensional Bayesian OptimizationHumanoid d = 6392
Rank9.8
21
High-Dimensional Bayesian OptimizationSVM D = 388
Objective Value0.131
17
High-Dimensional Bayesian OptimizationRover D = 100
Objective Value3.412
17
AutoML Hyperparameter OptimizationHPOBench SVM
Final Objective Value90.4
5
AutoML Hyperparameter OptimizationHPOBench MLP
Final Objective Value93.9
5
Bayesian Optimizationsuperconductor 86D
Final Objective Value76.57
5
Showing 7 of 7 rows

Other info

Follow for update