Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization

About

Black-box model-based optimization (MBO) problems, where the goal is to find a design input that maximizes an unknown objective function, are ubiquitous in a wide range of domains, such as the design of proteins, DNA sequences, aircraft, and robots. Solving model-based optimization problems typically requires actively querying the unknown objective function on design proposals, which means physically building the candidate molecule, aircraft, or robot, testing it, and storing the result. This process can be expensive and time consuming, and one might instead prefer to optimize for the best design using only the data one already has. This setting -- called offline MBO -- poses substantial and different algorithmic challenges than more commonly studied online techniques. A number of recent works have demonstrated success with offline MBO for high-dimensional optimization problems using high-capacity deep neural networks. However, the lack of standardized benchmarks in this emerging field is making progress difficult to track. To address this, we present Design-Bench, a benchmark for offline MBO with a unified evaluation protocol and reference implementations of recent methods. Our benchmark includes a suite of diverse and realistic tasks derived from real-world optimization problems in biology, materials science, and robotics that present distinct challenges for offline MBO. Our benchmark and reference implementations are released at github.com/rail-berkeley/design-bench and github.com/rail-berkeley/design-baselines.

Brandon Trabucco, Xinyang Geng, Aviral Kumar, Sergey Levine• 2022

Related benchmarks

TaskDatasetResultRank
Discrete OptimizationTF Bind 10
Median Normalized Score0.534
16
Discrete OptimizationTF Bind 8
Median Normalized Score58.1
16
Offline Model-Based OptimizationAnt Morphology (test)
Median Normalized Score0.57
16
Offline Model-Based OptimizationHopper Controller (test)
Median Normalized Score0.388
16
Neural Architecture SearchNAS
Median Normalized Score0.529
16
Offline Model-Based OptimizationSuperconductor (test)
Median Normalized Score0.368
16
Offline Model-Based OptimizationD'Kitty Morphology (test)
Median Normalized Score0.883
16
Offline Model-Based OptimizationAnt Morphology Design-Bench
100th Percentile Score0.882
15
Offline Model-Based OptimizationD'Kitty Morphology Design-Bench
100th Percentile Score90.6
15
Offline Model-Based OptimizationHopper Controller Design-Bench
Score (100th Pctl)0.141
15
Showing 10 of 13 rows

Other info

Follow for update