Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization

About

Optimizing complex and high-dimensional black-box functions is ubiquitous in science and engineering fields. Unfortunately, the online evaluation of these functions is restricted due to time and safety constraints in most cases. In offline model-based optimization (MBO), we aim to find a design that maximizes the target function using only a pre-existing offline dataset. While prior methods consider forward or inverse approaches to address the problem, these approaches are limited by conservatism and the difficulty of learning highly multi-modal mappings. Recently, there has been an emerging paradigm of learning to improve solutions with synthetic trajectories constructed from the offline dataset. In this paper, we introduce a novel conditional generative modeling approach to produce trajectories toward high-scoring regions. First, we construct synthetic trajectories toward high-scoring regions using the dataset while injecting locality bias for consistent improvement directions. Then, we train a conditional diffusion model to generate trajectories conditioned on their scores. Lastly, we sample multiple trajectories from the trained model with guidance to explore high-scoring regions beyond the dataset and select high-fidelity designs among generated trajectories with the proxy function. Extensive experiment results demonstrate that our method outperforms competitive baselines on Design-Bench and its practical variants. The code is publicly available in \texttt{https://github.com/dbsxodud-11/GTG}.

Taeyoung Yun, Sujin Yun, Jaewoo Lee, Jinkyoo Park• 2024

Related benchmarks

TaskDatasetResultRank
Offline Model-Based OptimizationD'Kitty Morphology Design-Bench
100th Percentile Score94.2
23
Offline Model-Based OptimizationAnt Morphology Design-Bench
100th Percentile Score0.855
23
Offline Model-Based OptimizationSuperconductor Design-Bench
Score (P100)48
22
Offline OptimizationD'Kitty 1% offline data
Score91.3
8
Offline OptimizationTFBind8 1% offline data
Score0.89
8
Offline Model-Based OptimizationDesign-Bench TF-Bind-8
100th Percentile Score91
8
Offline OptimizationAnt 1% offline data
Score82.5
8
Offline Model-Based OptimizationDesign-Bench TF-Bind-10
100th Percentile Normalized Score0.619
8
Offline Model-Based OptimizationDesign-Bench Aggregate
Average Rank5.6
7
Showing 9 of 9 rows

Other info

Follow for update