Equivariant 3D-Conditional Diffusion Models for Molecular Linker Design
About
Fragment-based drug discovery has been an effective paradigm in early-stage drug development. An open challenge in this area is designing linkers between disconnected molecular fragments of interest to obtain chemically-relevant candidate drug molecules. In this work, we propose DiffLinker, an E(3)-equivariant 3D-conditional diffusion model for molecular linker design. Given a set of disconnected fragments, our model places missing atoms in between and designs a molecule incorporating all the initial fragments. Unlike previous approaches that are only able to connect pairs of molecular fragments, our method can link an arbitrary number of fragments. Additionally, the model automatically determines the number of atoms in the linker and its attachment points to the input fragments. We demonstrate that DiffLinker outperforms other methods on the standard datasets generating more diverse and synthetically-accessible molecules. Besides, we experimentally test our method in real-world applications, showing that it can successfully generate valid linkers conditioned on target protein pockets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Molecule Generation | QM9 | P (0 rings)99.7 | 13 | |
| 3D Molecule Generation | GEOM-DRUG (above 11 rings) | P Score6.29 | 10 | |
| Molecule Generation | QM9 In Distribution | P91.8 | 8 | |
| Molecule Generation | QM9 (OOD I) | P Score90.3 | 8 | |
| Molecule Generation | QM9 (OOD II) | P80.4 | 8 | |
| Linker Design | QM9 (OOD I) | AS Score85.6 | 7 | |
| Linker Design | QM9 (OOD II) | AS76.5 | 7 | |
| Linker Design | QM9 In-dist | AS Score89.6 | 7 | |
| Dual-target drug design | 12,917 pairs of targets dual-target setting (test) | P-1 Vina Dock (Avg)-7.05 | 7 | |
| Linker Design | GEOM-LINKER (OOD) | QED56 | 3 |