Molecular De Novo Design through Deep Reinforcement Learning
About
This work introduces a method to tune a sequence-based generative model for molecular de novo design that through augmented episodic likelihood can learn to generate structures with certain specified desirable properties. We demonstrate how this model can execute a range of tasks such as generating analogues to a query structure and generating compounds predicted to be active against a biological target. As a proof of principle, the model is first trained to generate molecules that do not contain sulphur. As a second example, the model is trained to generate analogues to the drug Celecoxib, a technique that could be used for scaffold hopping or library expansion starting from a single molecule. Finally, when tuning the model towards generating compounds predicted to be active against the dopamine receptor type 2, the model generates structures of which more than 95% are predicted to be active, including experimentally confirmed actives that have not been included in either the generative model nor the activity prediction model.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Molecular Optimization (QED) | TOMG-Bench | Success Rate (SR)55.8 | 39 | |
| Molecular Optimization (MR) | TOMG-Bench | Success Rate (SR)59.5 | 39 | |
| Molecular Optimization (LogP) | TOMG-Bench | Success Rate (SR)46.5 | 39 | |
| Molecular Docking Score Optimization | Target proteins (PARP1, FA7, 5HT1B, BRAF, JAK2) (novel top 5% molecules) | -- | 38 | |
| Molecular Optimization | Practical Molecular Optimization (PMO) | Sum AUC top-1015.185 | 37 | |
| Molecular Generation | fa7 | Top-Hit 5% Docking Score (kcal/mol)-7.205 | 29 | |
| Molecular Generation | 5ht1b | Docking Score (Top-Hit 5%, kcal/mol)-8.77 | 29 | |
| Molecular Generation | parp1 | Top-Hit 5% Docking Score (kcal/mol)-8.702 | 29 | |
| Molecular Generation | jak2 | Top-Hit 5% Docking Score (kcal/mol)-8.165 | 29 | |
| Molecular Generation | braf | Top-Hit 5% Docking Score (kcal/mol)-8.392 | 28 |