Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Retrieval-based Controllable Molecule Generation

About

Generating new molecules with specified chemical and biological properties via generative models has emerged as a promising direction for drug discovery. However, existing methods require extensive training/fine-tuning with a large dataset, often unavailable in real-world generation tasks. In this work, we propose a new retrieval-based framework for controllable molecule generation. We use a small set of exemplar molecules, i.e., those that (partially) satisfy the design criteria, to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria. We design a retrieval mechanism that retrieves and fuses the exemplar molecules with the input molecule, which is trained by a new self-supervised objective that predicts the nearest neighbor of the input molecule. We also propose an iterative refinement process to dynamically update the generated molecules and retrieval database for better generalization. Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning. On various tasks ranging from simple design criteria to a challenging real-world scenario for designing lead compounds that bind to the SARS-CoV-2 main protease, we demonstrate our approach extrapolates well beyond the retrieval database, and achieves better performance and wider applicability than previous methods. Code is available at https://github.com/NVlabs/RetMol.

Zichao Wang, Weili Nie, Zhuoran Qiao, Chaowei Xiao, Richard Baraniuk, Anima Anandkumar• 2022

Related benchmarks

TaskDatasetResultRank
Molecular Generationparp1
Top-Hit 5% Docking Score (kcal/mol)-8.59
27
Molecular Generationfa7
Top-Hit 5% Docking Score (kcal/mol)-5.448
27
Molecular Generation5ht1b
Docking Score (Top-Hit 5%, kcal/mol)-6.98
27
Molecular Generationjak2
Top-Hit 5% Docking Score (kcal/mol)-7.133
27
Molecular Generationbraf
Top-Hit 5% Docking Score (kcal/mol)-8.811
26
Molecular Dockingjak2
Mean Docking Score-7.133
18
Molecular Dockingparp1
Mean Docking Score-8.59
18
Molecular Dockingfa7
Mean Docking Score-5.448
18
Molecular Docking5ht1b
Mean Docking Score-6.98
18
Molecular Dockingbraf
Mean Docking Score-8.811
17
Showing 10 of 10 rows

Other info

Follow for update