Generative Pretraining for Black-Box Optimization
About
Many problems in science and engineering involve optimizing an expensive black-box function over a high-dimensional space. For such black-box optimization (BBO) problems, we typically assume a small budget for online function evaluations, but also often have access to a fixed, offline dataset for pretraining. Prior approaches seek to utilize the offline data to approximate the function or its inverse but are not sufficiently accurate far from the data distribution. We propose BONET, a generative framework for pretraining a novel black-box optimizer using offline datasets. In BONET, we train an autoregressive model on fixed-length trajectories derived from an offline dataset. We design a sampling strategy to synthesize trajectories from offline data using a simple heuristic of rolling out monotonic transitions from low-fidelity to high-fidelity samples. Empirically, we instantiate BONET using a causally masked Transformer and evaluate it on Design-Bench, where we rank the best on average, outperforming state-of-the-art baselines.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Offline Black-box Optimization | Ant | Normalized Median Score0.819 | 25 | |
| Offline Black-box Optimization | D'Kitty | Normalized Median Score0.907 | 25 | |
| Offline Black-box Optimization | TF10 | Normalized Median Score0.496 | 25 | |
| Offline Black-box Optimization | LLM-DM | Normalized Median Score87 | 25 | |
| Offline Black-box Optimization | TF8 | Normalized Median Score50.5 | 25 | |
| Offline Black-box Optimization | SuperC | Normalized Median Score36.9 | 25 | |
| Offline Black-box Optimization | Overall Task Suite SuperC, Ant, D’Kitty, LLM-DM, TF8, TF10 | Mean Rank9.3 | 24 | |
| Offline Black-box Optimization | Design-bench 100-th percentile | TFBIND8 Score91.1 | 20 | |
| Offline Model-Based Optimization | UTR | 90th Percentile Oracle Score8.7 | 17 | |
| Offline Model-Based Optimization | GFP | 90th Percentile Oracle Score3.74 | 17 |