Molecular De Novo Design through Deep Reinforcement Learning
About
This work introduces a method to tune a sequence-based generative model for molecular de novo design that through augmented episodic likelihood can learn to generate structures with certain specified desirable properties. We demonstrate how this model can execute a range of tasks such as generating analogues to a query structure and generating compounds predicted to be active against a biological target. As a proof of principle, the model is first trained to generate molecules that do not contain sulphur. As a second example, the model is trained to generate analogues to the drug Celecoxib, a technique that could be used for scaffold hopping or library expansion starting from a single molecule. Finally, when tuning the model towards generating compounds predicted to be active against the dopamine receptor type 2, the model generates structures of which more than 95% are predicted to be active, including experimentally confirmed actives that have not been included in either the generative model nor the activity prediction model.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Molecular Generation | fa7 | Top-Hit 5% Docking Score (kcal/mol)-7.205 | 27 | |
| Molecular Generation | 5ht1b | Docking Score (Top-Hit 5%, kcal/mol)-8.77 | 27 | |
| Molecular Generation | parp1 | Top-Hit 5% Docking Score (kcal/mol)-8.702 | 27 | |
| Molecular Generation | jak2 | Top-Hit 5% Docking Score (kcal/mol)-8.165 | 27 | |
| Molecular Optimization | Practical Molecular Optimization (PMO) | Sum AUC top-1015.185 | 26 | |
| Molecular Generation | braf | Top-Hit 5% Docking Score (kcal/mol)-8.392 | 26 | |
| de novo molecular design | GuacaMol goal-directed tasks | Osimertinib MPO Score0.889 | 23 | |
| Molecular Docking | jak2 | Mean Docking Score-8.165 | 18 | |
| Molecular Docking | fa7 | Mean Docking Score-7.205 | 18 | |
| Molecular Docking | 5ht1b | Mean Docking Score-8.77 | 18 |