Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Molecule Generation with Fragment Retrieval Augmentation

About

Fragment-based drug discovery, in which molecular fragments are assembled into new molecules with desirable biochemical properties, has achieved great success. However, many fragment-based molecule generation methods show limited exploration beyond the existing fragments in the database as they only reassemble or slightly modify the given ones. To tackle this problem, we propose a new fragment-based molecule generation framework with retrieval augmentation, namely Fragment Retrieval-Augmented Generation (f-RAG). f-RAG is based on a pre-trained molecular generative model that proposes additional fragments from input fragments to complete and generate a new molecule. Given a fragment vocabulary, f-RAG retrieves two types of fragments: (1) hard fragments, which serve as building blocks that will be explicitly included in the newly generated molecule, and (2) soft fragments, which serve as reference to guide the generation of new fragments through a trainable fragment injection module. To extrapolate beyond the existing fragments, f-RAG updates the fragment vocabulary with generated fragments via an iterative refinement process which is further enhanced with post-hoc genetic fragment modification. f-RAG can achieve an improved exploration-exploitation trade-off by maintaining a pool of fragments and expanding it with novel and high-quality fragments through a strong generative prior.

Seul Lee, Karsten Kreis, Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Saee Paliwal, Arash Vahdat, Weili Nie• 2024

Related benchmarks

TaskDatasetResultRank
Molecular Generationparp1
Top-Hit 5% Docking Score (kcal/mol)-12.945
27
Molecular Generationfa7
Top-Hit 5% Docking Score (kcal/mol)-9.899
27
Molecular Generation5ht1b
Docking Score (Top-Hit 5%, kcal/mol)-12.67
27
Molecular Generationjak2
Top-Hit 5% Docking Score (kcal/mol)-11.842
27
Molecular Generationbraf
Top-Hit 5% Docking Score (kcal/mol)-12.39
26
Molecular Dockingparp1
Mean Docking Score-12.945
18
Molecular Docking5ht1b
Mean Docking Score-12.67
18
Molecular Dockingfa7
Mean Docking Score-9.899
18
Molecular Dockingjak2
Mean Docking Score-11.842
18
Molecular Dockingbraf
Mean Docking Score-12.39
17
Showing 10 of 16 rows

Other info

Follow for update