Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Genetic-guided GFlowNets for Sample Efficient Molecular Optimization

About

The challenge of discovering new molecules with desired properties is crucial in domains like drug discovery and material design. Recent advances in deep learning-based generative methods have shown promise but face the issue of sample efficiency due to the computational expense of evaluating the reward function. This paper proposes a novel algorithm for sample-efficient molecular optimization by distilling a powerful genetic algorithm into deep generative policy using GFlowNets training, the off-policy method for amortized inference. This approach enables the deep generative policy to learn from domain knowledge, which has been explicitly integrated into the genetic algorithm. Our method achieves state-of-the-art performance in the official molecular optimization benchmark, significantly outperforming previous methods. It also demonstrates effectiveness in designing inhibitors against SARS-CoV-2 with substantially fewer reward calls.

Hyeonah Kim, Minsu Kim, Sanghyeok Choi, Jinkyoo Park• 2024

Related benchmarks

TaskDatasetResultRank
Molecular Generationfa7
Top-Hit 5% Docking Score (kcal/mol)-7.288
27
Molecular Generation5ht1b
Docking Score (Top-Hit 5%, kcal/mol)-8.973
27
Molecular Generationjak2
Top-Hit 5% Docking Score (kcal/mol)-8.539
27
Molecular Generationparp1
Top-Hit 5% Docking Score (kcal/mol)-9.227
27
Molecular OptimizationPractical Molecular Optimization (PMO)
Sum AUC top-1016.213
26
Molecular Generationbraf
Top-Hit 5% Docking Score (kcal/mol)-8.719
26
Molecular Dockingjak2
Mean Docking Score-8.539
18
Molecular Dockingfa7
Mean Docking Score-7.288
18
Molecular Docking5ht1b
Mean Docking Score-8.973
18
Molecular Dockingparp1
Mean Docking Score-9.227
18
Showing 10 of 26 rows

Other info

Code

Follow for update