Conservative Objective Models for Effective Offline Model-Based Optimization
About
Computational design problems arise in a number of settings, from synthetic biology to computer architectures. In this paper, we aim to solve data-driven model-based optimization (MBO) problems, where the goal is to find a design input that maximizes an unknown objective function provided access to only a static dataset of prior experiments. Such data-driven optimization procedures are the only practical methods in many real-world domains where active data collection is expensive (e.g., when optimizing over proteins) or dangerous (e.g., when optimizing over aircraft designs). Typical methods for MBO that optimize the design against a learned model suffer from distributional shift: it is easy to find a design that "fools" the model into predicting a high value. To overcome this, we propose conservative objective models (COMs), a method that learns a model of the objective function that lower bounds the actual value of the ground-truth objective on out-of-distribution inputs, and uses it for optimization. Structurally, COMs resemble adversarial training methods used to overcome adversarial examples. COMs are simple to implement and outperform a number of existing methods on a wide range of MBO problems, including optimizing protein sequences, robot morphologies, neural network weights, and superconducting materials.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Offline Black-box Optimization | D'Kitty | Normalized Median Score0.881 | 25 | |
| Offline Black-box Optimization | Ant | Normalized Median Score0.564 | 25 | |
| Offline Black-box Optimization | TF8 | Normalized Median Score43.9 | 25 | |
| Offline Black-box Optimization | TF10 | Normalized Median Score0.467 | 25 | |
| Offline Black-box Optimization | LLM-DM | Normalized Median Score80.1 | 25 | |
| Offline Black-box Optimization | SuperC | Normalized Median Score31.6 | 25 | |
| Offline Black-box Optimization | Overall Task Suite SuperC, Ant, D’Kitty, LLM-DM, TF8, TF10 | Mean Rank17.5 | 24 | |
| Offline Model-Based Optimization | Ant Morphology Design-Bench | 100th Percentile Score0.944 | 23 | |
| Offline Model-Based Optimization | D'Kitty Morphology Design-Bench | 100th Percentile Score94.9 | 23 | |
| Model-Based Optimization | Lat. RBF 11 | Expected Top 1% Score66 | 22 |