Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bidirectional Learning for Offline Infinite-width Model-based Optimization

About

In offline model-based optimization, we strive to maximize a black-box objective function by only leveraging a static dataset of designs and their scores. This problem setting arises in numerous fields including the design of materials, robots, DNA sequences, and proteins. Recent approaches train a deep neural network (DNN) on the static dataset to act as a proxy function, and then perform gradient ascent on the existing designs to obtain potentially high-scoring designs. This methodology frequently suffers from the out-of-distribution problem where the proxy function often returns poor designs. To mitigate this problem, we propose BiDirectional learning for offline Infinite-width model-based optimization (BDI). BDI consists of two mappings: the forward mapping leverages the static dataset to predict the scores of the high-scoring designs, and the backward mapping leverages the high-scoring designs to predict the scores of the static dataset. The backward mapping, neglected in previous work, can distill more information from the static dataset into the high-scoring designs, which effectively mitigates the out-of-distribution problem. For a finite-width DNN model, the loss function of the backward mapping is intractable and only has an approximate form, which leads to a significant deterioration of the design quality. We thus adopt an infinite-width DNN model, and propose to employ the corresponding neural tangent kernel to yield a closed-form loss for more accurate design updates. Experiments on various tasks verify the effectiveness of BDI. The code is available at https://github.com/GGchen1997/BDI.

Can Chen, Yingxue Zhang, Jie Fu, Xue Liu, Mark Coates• 2022

Related benchmarks

TaskDatasetResultRank
Offline Black-box OptimizationLLM-DM
Normalized Median Score91.2
25
Offline Black-box OptimizationSuperC
Normalized Median Score41.2
25
Offline Black-box OptimizationTF10
Normalized Median Score0.476
25
Offline Black-box OptimizationD'Kitty
Normalized Median Score0.855
25
Offline Black-box OptimizationAnt
Normalized Median Score0.474
25
Offline Black-box OptimizationTF8
Normalized Median Score43.9
25
Offline Black-box OptimizationOverall Task Suite SuperC, Ant, D’Kitty, LLM-DM, TF8, TF10
Mean Rank12.8
24
Offline Black-box OptimizationDesign-bench 100-th percentile
TFBIND8 Score87
20
Offline Model-Based OptimizationD'Kitty
Oracle Score (90th Pctl)0.62
17
Offline Model-Based OptimizationGFP
90th Percentile Oracle Score3.62
17
Showing 10 of 26 rows

Other info

Follow for update