Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MARS: Markov Molecular Sampling for Multi-objective Drug Discovery

About

Searching for novel molecules with desired chemical properties is crucial in drug discovery. Existing work focuses on developing neural models to generate either molecular sequences or chemical graphs. However, it remains a big challenge to find novel and diverse compounds satisfying several properties. In this paper, we propose MARS, a method for multi-objective drug molecule discovery. MARS is based on the idea of generating the chemical candidates by iteratively editing fragments of molecular graphs. To search for high-quality candidates, it employs Markov chain Monte Carlo sampling (MCMC) on molecules with an annealing scheme and an adaptive proposal. To further improve sample efficiency, MARS uses a graph neural network (GNN) to represent and select candidate edits, where the GNN is trained on-the-fly with samples from MCMC. Experiments show that MARS achieves state-of-the-art performance in various multi-objective settings where molecular bio-activity, drug-likeness, and synthesizability are considered. Remarkably, in the most challenging setting where all four objectives are simultaneously optimized, our approach outperforms previous methods significantly in comprehensive evaluations. The code is available at https://github.com/yutxie/mars.

Yutong Xie, Chence Shi, Hao Zhou, Yuwei Yang, Weinan Zhang, Yong Yu, Lei Li• 2021

Related benchmarks

TaskDatasetResultRank
Receptor Docking AffinityTDC DRD3 (leaderboard)
Affinity Score-9.1
48
Property optimizationZINC250k (test)
1st Order Metric0.944
33
Constrained Property OptimizationZINC250K
Improvement4.13
27
Molecular Generationfa7
Top-Hit 5% Docking Score (kcal/mol)-7.839
27
Molecular Generation5ht1b
Docking Score (Top-Hit 5%, kcal/mol)-9.804
27
Molecular Generationjak2
Top-Hit 5% Docking Score (kcal/mol)-9.15
27
Molecular Generationparp1
Top-Hit 5% Docking Score (kcal/mol)-9.716
27
Molecular Generationbraf
Top-Hit 5% Docking Score (kcal/mol)-9.569
26
Molecule DesignMolecule Design (test)
Mode Coverage (R>7.5)0.00e+0
26
de novo molecular designGuacaMol goal-directed tasks
Osimertinib MPO Score0.809
23
Showing 10 of 24 rows

Other info

Follow for update