Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Design Editing for Offline Model-based Optimization

About

Offline model-based optimization (MBO) aims to maximize a black-box objective function using only an offline dataset of designs and scores. These tasks span various domains, such as robotics, material design, and protein and molecular engineering. A common approach involves training a surrogate model using existing designs and their corresponding scores, and then generating new designs through gradient-based updates with respect to the surrogate model. This method suffers from the out-of-distribution issue, where the surrogate model may erroneously predict high scores for unseen designs. To address this challenge, we introduce a novel method, Design Editing for Offline Model-based Optimization (DEMO), which leverages a diffusion prior to calibrate overly optimized designs. DEMO first generates pseudo design candidates by performing gradient ascent with respect to a surrogate model. While these pseudo design candidates contain information beyond the offline dataset, they might be invalid or have erroneously high predicted scores. Therefore, to address this challenge while utilizing the information provided by pseudo design candidates, we propose an editing process to refine these pseudo design candidates. We introduce noise to the pseudo design candidates and subsequently denoise them with a diffusion prior trained on the offline dataset, ensuring they align with the distribution of valid designs. Empirical evaluations on seven offline MBO tasks show that, with properly tuned hyperparameters, DEMOs score is competitive with the best previously reported scores in the literature.

Ye Yuan, Youyuan Zhang, Can Chen, Haolun Wu, Zixuan Li, Jianmo Li, James J. Clark, Xue Liu• 2024

Related benchmarks

TaskDatasetResultRank
Offline Black-box OptimizationTF8
Normalized Median Score61.7
25
Offline Black-box OptimizationLLM-DM
Normalized Median Score90.6
25
Offline Black-box OptimizationTF10
Normalized Median Score0.522
25
Offline Black-box OptimizationD'Kitty
Normalized Median Score0.891
25
Offline Black-box OptimizationSuperC
Normalized Median Score40
25
Offline Black-box OptimizationAnt
Normalized Median Score0.604
25
Offline Black-box OptimizationOverall Task Suite SuperC, Ant, D’Kitty, LLM-DM, TF8, TF10
Mean Rank5.5
24
Locomotion ControlAnt 1% offline data
Optimization Performance Score80
11
Locomotion ControlD'Kitty 1% offline data
Optimization Performance Score0.845
11
Sequence designRNA3 1% offline data
Optimization Performance Score29.5
11
Showing 10 of 13 rows

Other info

Follow for update