Sample Efficient Generative Molecular Optimization with Joint Self-Improvement
About
Generative molecular optimization aims to design molecules with properties surpassing those of existing compounds. However, such candidates are rare and expensive to evaluate, yielding sample efficiency essential. Additionally, surrogate models introduced to predict molecule evaluations, suffer from distribution shift as optimization drives candidates increasingly out-of-distribution. To address these challenges, we introduce Joint Self-Improvement, which benefits from (i) a joint generative-predictive model and (ii) a self-improving sampling scheme. The former aligns the generator with the surrogate, alleviating distribution shift, while the latter biases the generative part of the joint model using the predictive one to efficiently generate optimized molecules at inference-time. Experiments across offline and online molecular optimization benchmarks demonstrate that Joint Self-Improvement outperforms state-of-the-art methods under limited evaluation budgets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Molecular Optimization | PARP1 online | Hit Ratio69.403 | 8 | |
| Molecular Optimization | FA7 online | Hit Ratio28.667 | 8 | |
| Molecular Optimization | 5HT1B online | Hit Ratio84.91 | 8 | |
| Molecular Optimization | BRAF online | Hit Ratio41.74 | 8 | |
| Molecular Optimization | JAK2 online | Hit Ratio62.317 | 8 | |
| Molecular Optimization | PARP1 (offline) | Hit Ratio45.628 | 5 | |
| Molecular Optimization | FA7 (offline) | Hit Ratio583.1 | 5 | |
| Molecular Optimization | 5HT1B (offline) | Hit Ratio76.94 | 5 | |
| Molecular Optimization | BRAF (offline) | Hit Ratio2.16e+3 | 5 | |
| Molecular Optimization | JAK2 (offline) | Hit Ratio42.001 | 5 |