Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization
About
Optimizing complex and high-dimensional black-box functions is ubiquitous in science and engineering fields. Unfortunately, the online evaluation of these functions is restricted due to time and safety constraints in most cases. In offline model-based optimization (MBO), we aim to find a design that maximizes the target function using only a pre-existing offline dataset. While prior methods consider forward or inverse approaches to address the problem, these approaches are limited by conservatism and the difficulty of learning highly multi-modal mappings. Recently, there has been an emerging paradigm of learning to improve solutions with synthetic trajectories constructed from the offline dataset. In this paper, we introduce a novel conditional generative modeling approach to produce trajectories toward high-scoring regions. First, we construct synthetic trajectories toward high-scoring regions using the dataset while injecting locality bias for consistent improvement directions. Then, we train a conditional diffusion model to generate trajectories conditioned on their scores. Lastly, we sample multiple trajectories from the trained model with guidance to explore high-scoring regions beyond the dataset and select high-fidelity designs among generated trajectories with the proxy function. Extensive experiment results demonstrate that our method outperforms competitive baselines on Design-Bench and its practical variants. The code is publicly available in \texttt{https://github.com/dbsxodud-11/GTG}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Offline Black-box Optimization | Ant | Normalized Median Score0.645 | 25 | |
| Offline Black-box Optimization | D'Kitty | Normalized Median Score0.901 | 25 | |
| Offline Black-box Optimization | LLM-DM | Normalized Median Score89.5 | 25 | |
| Offline Black-box Optimization | SuperC | Normalized Median Score38 | 25 | |
| Offline Black-box Optimization | TF8 | Normalized Median Score46 | 25 | |
| Offline Black-box Optimization | TF10 | Normalized Median Score0.452 | 25 | |
| Offline Black-box Optimization | Overall Task Suite SuperC, Ant, D’Kitty, LLM-DM, TF8, TF10 | Mean Rank11.5 | 24 | |
| Offline Model-Based Optimization | D'Kitty Morphology Design-Bench | 100th Percentile Score94.2 | 23 | |
| Offline Model-Based Optimization | Ant Morphology Design-Bench | 100th Percentile Score0.855 | 23 | |
| Offline Model-Based Optimization | Superconductor Design-Bench | Score (P100)48 | 22 |