Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Conservative Objective Models for Effective Offline Model-Based Optimization

About

Computational design problems arise in a number of settings, from synthetic biology to computer architectures. In this paper, we aim to solve data-driven model-based optimization (MBO) problems, where the goal is to find a design input that maximizes an unknown objective function provided access to only a static dataset of prior experiments. Such data-driven optimization procedures are the only practical methods in many real-world domains where active data collection is expensive (e.g., when optimizing over proteins) or dangerous (e.g., when optimizing over aircraft designs). Typical methods for MBO that optimize the design against a learned model suffer from distributional shift: it is easy to find a design that "fools" the model into predicting a high value. To overcome this, we propose conservative objective models (COMs), a method that learns a model of the objective function that lower bounds the actual value of the ground-truth objective on out-of-distribution inputs, and uses it for optimization. Structurally, COMs resemble adversarial training methods used to overcome adversarial examples. COMs are simple to implement and outperform a number of existing methods on a wide range of MBO problems, including optimizing protein sequences, robot morphologies, neural network weights, and superconducting materials.

Brandon Trabucco, Aviral Kumar, Xinyang Geng, Sergey Levine• 2021

Related benchmarks

TaskDatasetResultRank
Offline Model-Based OptimizationAnt Morphology Design-Bench
100th Percentile Score0.944
23
Offline Model-Based OptimizationD'Kitty Morphology Design-Bench
100th Percentile Score94.9
23
Model-Based OptimizationLat. RBF 11
Expected Top 1% Score66
22
Offline Model-Based OptimizationSuperconductor Design-Bench
Score (P100)51.8
22
Model-Based OptimizationLat. RBF 41
Expected Top 1% Score45
22
Model-Based OptimizationLat. RBF 61
Expected Top 1% Score25
22
Model-Based OptimizationLat. RBF 31
Expected Top 1% Score0.5
22
Model-Based OptimizationTF Bind 8
Expected Top 1% Score1.57
22
Model-Based OptimizationDesign-Bench
LogP30.8
16
Model-Based OptimizationDesign-Bench 2022 (test)
TF-Bind-8 Score0.902
16
Showing 10 of 24 rows

Other info

Follow for update