Importance-aware Co-teaching for Offline Model-based Optimization

About

Offline model-based optimization aims to find a design that maximizes a property of interest using only an offline dataset, with applications in robot, protein, and molecule design, among others. A prevalent approach is gradient ascent, where a proxy model is trained on the offline dataset and then used to optimize the design. This method suffers from an out-of-distribution issue, where the proxy is not accurate for unseen designs. To mitigate this issue, we explore using a pseudo-labeler to generate valuable data for fine-tuning the proxy. Specifically, we propose \textit{\textbf{I}mportance-aware \textbf{C}o-\textbf{T}eaching for Offline Model-based Optimization}~(\textbf{ICT}). This method maintains three symmetric proxies with their mean ensemble as the final proxy, and comprises two steps. The first step is \textit{pseudo-label-driven co-teaching}. In this step, one proxy is iteratively selected as the pseudo-labeler for designs near the current optimization point, generating pseudo-labeled data. Subsequently, a co-teaching process identifies small-loss samples as valuable data and exchanges them between the other two proxies for fine-tuning, promoting knowledge transfer. This procedure is repeated three times, with a different proxy chosen as the pseudo-labeler each time, ultimately enhancing the ensemble performance. To further improve accuracy of pseudo-labels, we perform a secondary step of \textit{meta-learning-based sample reweighting}, which assigns importance weights to samples in the pseudo-labeled dataset and updates them via meta-learning. ICT achieves state-of-the-art results across multiple design-bench tasks, achieving the best mean rank of $3.1$ and median rank of $2$, among $15$ methods. Our source code can be found here.

Ye Yuan, Can Chen, Zixuan Liu, Willie Neiswanger, Xue Liu• 2023

Related benchmarks

Task	Dataset	Result
Offline Multi-objective Optimization	Off-MOO-Bench	Avg Rank (Overall)6.9	51
Offline Multi-objective Optimization	Off-MOO-Bench MO-NAS	Average IGDoffline Rank8.8	34
Offline Multi-objective Optimization	Off-MOO-Bench Sci-Design	Average IGDoffline Rank9.55	34
Offline Multi-objective Optimization	Off-MOO-Bench MORL	Average IGDoffline Rank8	30
Offline Black-box Optimization	TF10	Normalized Median Score0.541	25
Offline Black-box Optimization	SuperC	Normalized Median Score39.9	25
Offline Black-box Optimization	Ant	Normalized Median Score0.592	25
Offline Black-box Optimization	TF8	Normalized Median Score55.1	25
Offline Black-box Optimization	D'Kitty	Normalized Median Score0.874	25
Offline Black-box Optimization	LLM-DM	Normalized Median Score83	25

Showing 10 of 19 rows

Other info

Follow for update

@wizwand_team Discord