Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Genetic-guided GFlowNets for Sample Efficient Molecular Optimization

About

The challenge of discovering new molecules with desired properties is crucial in domains like drug discovery and material design. Recent advances in deep learning-based generative methods have shown promise but face the issue of sample efficiency due to the computational expense of evaluating the reward function. This paper proposes a novel algorithm for sample-efficient molecular optimization by distilling a powerful genetic algorithm into deep generative policy using GFlowNets training, the off-policy method for amortized inference. This approach enables the deep generative policy to learn from domain knowledge, which has been explicitly integrated into the genetic algorithm. Our method achieves state-of-the-art performance in the official molecular optimization benchmark, significantly outperforming previous methods. It also demonstrates effectiveness in designing inhibitors against SARS-CoV-2 with substantially fewer reward calls.

Hyeonah Kim, Minsu Kim, Sanghyeok Choi, Jinkyoo Park• 2024

Related benchmarks

TaskDatasetResultRank
Molecular OptimizationPractical Molecular Optimization (PMO)
Sum AUC top-1016.213
37
Molecular Generationfa7
Top-Hit 5% Docking Score (kcal/mol)-7.288
29
Molecular Generation5ht1b
Docking Score (Top-Hit 5%, kcal/mol)-8.973
29
Molecular Generationjak2
Top-Hit 5% Docking Score (kcal/mol)-8.539
29
Molecular Generationparp1
Top-Hit 5% Docking Score (kcal/mol)-9.227
29
Molecular Generationbraf
Top-Hit 5% Docking Score (kcal/mol)-8.719
28
Goal-directed molecular optimizationPMO
Amlodipine MPO0.761
20
Molecular Dockingjak2
Mean Docking Score-8.539
18
Molecular Dockingfa7
Mean Docking Score-7.288
18
Molecular Docking5ht1b
Mean Docking Score-8.973
18
Showing 10 of 27 rows

Other info

Code

Follow for update