Conservative Objective Models for Effective Offline Model-Based Optimization

About

Computational design problems arise in a number of settings, from synthetic biology to computer architectures. In this paper, we aim to solve data-driven model-based optimization (MBO) problems, where the goal is to find a design input that maximizes an unknown objective function provided access to only a static dataset of prior experiments. Such data-driven optimization procedures are the only practical methods in many real-world domains where active data collection is expensive (e.g., when optimizing over proteins) or dangerous (e.g., when optimizing over aircraft designs). Typical methods for MBO that optimize the design against a learned model suffer from distributional shift: it is easy to find a design that "fools" the model into predicting a high value. To overcome this, we propose conservative objective models (COMs), a method that learns a model of the objective function that lower bounds the actual value of the ground-truth objective on out-of-distribution inputs, and uses it for optimization. Structurally, COMs resemble adversarial training methods used to overcome adversarial examples. COMs are simple to implement and outperform a number of existing methods on a wide range of MBO problems, including optimizing protein sequences, robot morphologies, neural network weights, and superconducting materials.

Brandon Trabucco, Aviral Kumar, Xinyang Geng, Sergey Levine• 2021

Related benchmarks

Task	Dataset	Result
Offline Multi-objective Optimization	Off-MOO-Bench	Avg Rank (Overall)8.58	51
Offline Multi-objective Optimization	Off-MOO-Bench MO-NAS	Average IGDoffline Rank6.56	34
Offline Multi-objective Optimization	Off-MOO-Bench Sci-Design	Average IGDoffline Rank6.72	34
Offline Multi-objective Optimization	Off-MOO-Bench MORL	Average IGDoffline Rank6	30
Offline Black-box Optimization	D'Kitty	Normalized Median Score0.881	25
Offline Black-box Optimization	Ant	Normalized Median Score0.564	25
Offline Black-box Optimization	TF8	Normalized Median Score43.9	25
Offline Black-box Optimization	TF10	Normalized Median Score0.467	25
Offline Black-box Optimization	LLM-DM	Normalized Median Score80.1	25
Offline Black-box Optimization	SuperC	Normalized Median Score31.6	25

Showing 10 of 47 rows

Other info

Follow for update

@wizwand_team Discord