Design Editing for Offline Model-based Optimization

About

Offline model-based optimization (MBO) aims to maximize a black-box objective function using only an offline dataset of designs and scores. These tasks span various domains, such as robotics, material design, and protein and molecular engineering. A common approach involves training a surrogate model using existing designs and their corresponding scores, and then generating new designs through gradient-based updates with respect to the surrogate model. This method suffers from the out-of-distribution issue, where the surrogate model may erroneously predict high scores for unseen designs. To address this challenge, we introduce a novel method, Design Editing for Offline Model-based Optimization (DEMO), which leverages a diffusion prior to calibrate overly optimized designs. DEMO first generates pseudo design candidates by performing gradient ascent with respect to a surrogate model. While these pseudo design candidates contain information beyond the offline dataset, they might be invalid or have erroneously high predicted scores. Therefore, to address this challenge while utilizing the information provided by pseudo design candidates, we propose an editing process to refine these pseudo design candidates. We introduce noise to the pseudo design candidates and subsequently denoise them with a diffusion prior trained on the offline dataset, ensuring they align with the distribution of valid designs. Empirical evaluations on seven offline MBO tasks show that, with properly tuned hyperparameters, DEMOs score is competitive with the best previously reported scores in the literature.

Ye Yuan, Youyuan Zhang, Can Chen, Haolun Wu, Zixuan Li, Jianmo Li, James J. Clark, Xue Liu• 2024

Related benchmarks

Task	Dataset	Result
Offline Black-box Optimization	TF8	Normalized Median Score61.7	25
Offline Black-box Optimization	LLM-DM	Normalized Median Score90.6	25
Offline Black-box Optimization	TF10	Normalized Median Score0.522	25
Offline Black-box Optimization	D'Kitty	Normalized Median Score0.891	25
Offline Black-box Optimization	SuperC	Normalized Median Score40	25
Offline Black-box Optimization	Ant	Normalized Median Score0.604	25
Offline Black-box Optimization	Overall Task Suite SuperC, Ant, D’Kitty, LLM-DM, TF8, TF10	Mean Rank5.5	24
Locomotion Control	Ant 1% offline data	Optimization Performance Score80	11
Locomotion Control	D'Kitty 1% offline data	Optimization Performance Score0.845	11
Sequence design	RNA3 1% offline data	Optimization Performance Score29.5	11

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord