Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-Objective Molecule Generation using Interpretable Substructures

About

Drug discovery aims to find novel compounds with specified chemical property profiles. In terms of generative modeling, the goal is to learn to sample molecules in the intersection of multiple property constraints. This task becomes increasingly challenging when there are many property constraints. We propose to offset this complexity by composing molecules from a vocabulary of substructures that we call molecular rationales. These rationales are identified from molecules as substructures that are likely responsible for each property of interest. We then learn to expand rationales into a full molecule using graph generative models. Our final generative model composes molecules as mixtures of multiple rationale completions, and this mixture is fine-tuned to preserve the properties of interest. We evaluate our model on various drug design tasks and demonstrate significant improvements over state-of-the-art baselines in terms of accuracy, diversity, and novelty of generated compounds.

Wengong Jin, Regina Barzilay, Tommi Jaakkola• 2020

Related benchmarks

TaskDatasetResultRank
Molecular Generationparp1
Top-Hit 5% Docking Score (kcal/mol)-10.663
27
Molecular Generationfa7
Top-Hit 5% Docking Score (kcal/mol)-8.129
27
Molecular Generationjak2
Top-Hit 5% Docking Score (kcal/mol)-9.398
27
Molecular Generation5ht1b
Docking Score (Top-Hit 5%, kcal/mol)-9.005
27
Molecular Dockingparp1
Mean Docking Score-10.663
18
Molecular Dockingfa7
Mean Docking Score-8.129
18
Molecular Dockingjak2
Mean Docking Score-9.398
18
Molecular Docking5ht1b
Mean Docking Score-9.005
18
structure-based drug designprotein targets (Set B)
Uniqueness100
14
structure-based drug designCross-Docked 2020 57 (test)
TOP-100 Score-9.233
14
Showing 10 of 16 rows

Other info

Follow for update