Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Design Editing for Offline Model-based Optimization

About

Offline model-based optimization (MBO) aims to maximize a black-box objective function using only an offline dataset of designs and scores. These tasks span various domains, such as robotics, material design, and protein and molecular engineering. A common approach involves training a surrogate model using existing designs and their corresponding scores, and then generating new designs through gradient-based updates with respect to the surrogate model. This method suffers from the out-of-distribution issue, where the surrogate model may erroneously predict high scores for unseen designs. To address this challenge, we introduce a novel method, Design Editing for Offline Model-based Optimization (DEMO), which leverages a diffusion prior to calibrate overly optimized designs. DEMO first generates pseudo design candidates by performing gradient ascent with respect to a surrogate model. While these pseudo design candidates contain information beyond the offline dataset, they might be invalid or have erroneously high predicted scores. Therefore, to address this challenge while utilizing the information provided by pseudo design candidates, we propose an editing process to refine these pseudo design candidates. We introduce noise to the pseudo design candidates and subsequently denoise them with a diffusion prior trained on the offline dataset, ensuring they align with the distribution of valid designs. Empirical evaluations on seven offline MBO tasks show that, with properly tuned hyperparameters, DEMOs score is competitive with the best previously reported scores in the literature.

Ye Yuan, Youyuan Zhang, Can Chen, Haolun Wu, Zixuan Li, Jianmo Li, James J. Clark, Xue Liu• 2024

Related benchmarks

TaskDatasetResultRank
Locomotion ControlAnt 1% offline data
Optimization Performance Score80
11
Locomotion ControlD'Kitty 1% offline data
Optimization Performance Score0.845
11
Sequence designRNA3 1% offline data
Optimization Performance Score29.5
11
Protein-DNA binding affinity predictionTFBind8 1% offline data
Optimization Performance Score0.677
11
Sequence designRNA1 1% offline data
Optimization Performance Score16.6
11
Sequence designRNA2 1% offline data
Optimization Performance Score0.141
11
Showing 6 of 6 rows

Other info

Follow for update