Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Genetic Algorithm for Navigating Synthesizable Molecular Spaces

About

Inspired by the effectiveness of genetic algorithms and the importance of synthesizability in molecular design, we present SynGA, a simple genetic algorithm that operates directly over synthesis routes. Our method features custom crossover and mutation operators that explicitly constrain it to synthesizable molecular space. By modifying the fitness function, we demonstrate the effectiveness of SynGA on a variety of design tasks, including synthesizable analog search and sample-efficient property optimization, for both 2D and 3D objectives. Furthermore, by coupling SynGA with a machine learning-based filter that focuses the building block set, we boost SynGA to state-of-the-art performance. For property optimization, this manifests as a model-based variant SynGBO, which employs SynGA and block filtering in the inner loop of Bayesian optimization. Since SynGA is lightweight and enforces synthesizability by construction, our hope is that SynGA can not only serve as a strong standalone baseline but also as a versatile module that can be incorporated into larger synthesis-aware workflows in the future.

Alston Lo, Connor W. Coley, Wojciech Matusik• 2025

Related benchmarks

TaskDatasetResultRank
Molecular OptimizationPractical Molecular Optimization (PMO)
Sum AUC top-1016.426
37
Goal-directed molecular optimizationPMO
Amlodipine MPO0.67
20
Docking optimizationLIT-PCBA top-100 diverse modes
ALDH1 Score12.36
8
Synthesizable Analog SearchChEMBL 1k molecules
Validity100
6
Molecular DockingALDH1 LIT-PCBA
Performance (1k Calls)50.5
4
Analog SearchZINC
Validity100
3
Analog SearchChEMBL
Validity100
3
Molecular OptimizationPMO suite 13 tasks
Sum of top-10 AUC9.332
3
Analog SearchDDS-10
Validity100
3
Showing 9 of 9 rows

Other info

Follow for update