Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Conservative Objective Models for Effective Offline Model-Based Optimization

About

Computational design problems arise in a number of settings, from synthetic biology to computer architectures. In this paper, we aim to solve data-driven model-based optimization (MBO) problems, where the goal is to find a design input that maximizes an unknown objective function provided access to only a static dataset of prior experiments. Such data-driven optimization procedures are the only practical methods in many real-world domains where active data collection is expensive (e.g., when optimizing over proteins) or dangerous (e.g., when optimizing over aircraft designs). Typical methods for MBO that optimize the design against a learned model suffer from distributional shift: it is easy to find a design that "fools" the model into predicting a high value. To overcome this, we propose conservative objective models (COMs), a method that learns a model of the objective function that lower bounds the actual value of the ground-truth objective on out-of-distribution inputs, and uses it for optimization. Structurally, COMs resemble adversarial training methods used to overcome adversarial examples. COMs are simple to implement and outperform a number of existing methods on a wide range of MBO problems, including optimizing protein sequences, robot morphologies, neural network weights, and superconducting materials.

Brandon Trabucco, Aviral Kumar, Xinyang Geng, Sergey Levine• 2021

Related benchmarks

TaskDatasetResultRank
Offline Black-box OptimizationD'Kitty
Normalized Median Score0.881
25
Offline Black-box OptimizationAnt
Normalized Median Score0.564
25
Offline Black-box OptimizationTF8
Normalized Median Score43.9
25
Offline Black-box OptimizationTF10
Normalized Median Score0.467
25
Offline Black-box OptimizationLLM-DM
Normalized Median Score80.1
25
Offline Black-box OptimizationSuperC
Normalized Median Score31.6
25
Offline Black-box OptimizationOverall Task Suite SuperC, Ant, D’Kitty, LLM-DM, TF8, TF10
Mean Rank17.5
24
Offline Model-Based OptimizationAnt Morphology Design-Bench
100th Percentile Score0.944
23
Offline Model-Based OptimizationD'Kitty Morphology Design-Bench
100th Percentile Score94.9
23
Model-Based OptimizationLat. RBF 11
Expected Top 1% Score66
22
Showing 10 of 32 rows

Other info

Follow for update