Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Molecule Generation by Principal Subgraph Mining and Assembling

About

Molecule generation is central to a variety of applications. Current attention has been paid to approaching the generation task as subgraph prediction and assembling. Nevertheless, these methods usually rely on hand-crafted or external subgraph construction, and the subgraph assembling depends solely on local arrangement. In this paper, we define a novel notion, principal subgraph, that is closely related to the informative pattern within molecules. Interestingly, our proposed merge-and-update subgraph extraction method can automatically discover frequent principal subgraphs from the dataset, while previous methods are incapable of. Moreover, we develop a two-step subgraph assembling strategy, which first predicts a set of subgraphs in a sequence-wise manner and then assembles all generated subgraphs globally as the final output molecule. Built upon graph variational auto-encoder, our model is demonstrated to be effective in terms of several evaluation metrics and efficiency, compared with state-of-the-art methods on distribution learning and (constrained) property optimization tasks.

Xiangzhe Kong, Wenbing Huang, Zhixing Tan, Yang Liu• 2021

Related benchmarks

TaskDatasetResultRank
Protein-ligand binding affinity predictionPDBbind Sequence Identity (30%) 2017
RMSE1.387
71
Property optimizationZINC250k (test)
1st Order Metric0.948
33
Molecular Generationfa7
Top-Hit 5% Docking Score (kcal/mol)-8.028
29
Molecular Generation5ht1b
Docking Score (Top-Hit 5%, kcal/mol)-9.887
29
Molecular Generationjak2
Top-Hit 5% Docking Score (kcal/mol)-9.464
29
Molecular Generationparp1
Top-Hit 5% Docking Score (kcal/mol)-9.978
29
Molecular Generationbraf
Top-Hit 5% Docking Score (kcal/mol)-9.637
28
Constrained Property OptimizationZINC250K
Improvement6.42
27
Molecular Dockingfa7
Mean Docking Score-8.028
18
Molecular Docking5ht1b
Mean Docking Score-9.887
18
Showing 10 of 23 rows

Other info

Code

Follow for update