Genetic algorithms are strong baselines for molecule generation
About
Generating molecules, both in a directed and undirected fashion, is a huge part of the drug discovery pipeline. Genetic algorithms (GAs) generate molecules by randomly modifying known molecules. In this paper we show that GAs are very strong algorithms for such tasks, outperforming many complicated machine learning methods: a result which many researchers may find surprising. We therefore propose insisting during peer review that new algorithms must have some clear advantage over GAs, which we call the GA criterion. Ultimately our work suggests that a lot of research in molecule generation should be re-assessed.
Austin Tripp, Jos\'e Miguel Hern\'andez-Lobato• 2023
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Molecular Optimization | Practical Molecular Optimization (PMO) | Sum AUC top-1015.686 | 26 | |
| Goal-directed molecular optimization | PMO | Albuterol Similarity0.928 | 16 | |
| Molecular Generation | GuacaMol (test) | Amlo (AUC-Top10)0.688 | 10 | |
| Scaffold-constrained molecular optimization | Kinase Scaffold Decoration (test) | Objective Score0.438 | 6 |
Showing 4 of 4 rows