Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization
About
Optimizing complex and high-dimensional black-box functions is ubiquitous in science and engineering fields. Unfortunately, the online evaluation of these functions is restricted due to time and safety constraints in most cases. In offline model-based optimization (MBO), we aim to find a design that maximizes the target function using only a pre-existing offline dataset. While prior methods consider forward or inverse approaches to address the problem, these approaches are limited by conservatism and the difficulty of learning highly multi-modal mappings. Recently, there has been an emerging paradigm of learning to improve solutions with synthetic trajectories constructed from the offline dataset. In this paper, we introduce a novel conditional generative modeling approach to produce trajectories toward high-scoring regions. First, we construct synthetic trajectories toward high-scoring regions using the dataset while injecting locality bias for consistent improvement directions. Then, we train a conditional diffusion model to generate trajectories conditioned on their scores. Lastly, we sample multiple trajectories from the trained model with guidance to explore high-scoring regions beyond the dataset and select high-fidelity designs among generated trajectories with the proxy function. Extensive experiment results demonstrate that our method outperforms competitive baselines on Design-Bench and its practical variants. The code is publicly available in \texttt{https://github.com/dbsxodud-11/GTG}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Offline Model-Based Optimization | D'Kitty Morphology Design-Bench | 100th Percentile Score94.2 | 23 | |
| Offline Model-Based Optimization | Ant Morphology Design-Bench | 100th Percentile Score0.855 | 23 | |
| Offline Model-Based Optimization | Superconductor Design-Bench | Score (P100)48 | 22 | |
| Offline Optimization | D'Kitty 1% offline data | Score91.3 | 8 | |
| Offline Optimization | TFBind8 1% offline data | Score0.89 | 8 | |
| Offline Model-Based Optimization | Design-Bench TF-Bind-8 | 100th Percentile Score91 | 8 | |
| Offline Optimization | Ant 1% offline data | Score82.5 | 8 | |
| Offline Model-Based Optimization | Design-Bench TF-Bind-10 | 100th Percentile Normalized Score0.619 | 8 | |
| Offline Model-Based Optimization | Design-Bench Aggregate | Average Rank5.6 | 7 |