Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pre-training Molecular Graph Representation with 3D Geometry

About

Molecular graph representation learning is a fundamental problem in modern drug and material discovery. Molecular graphs are typically modeled by their 2D topological structures, but it has been recently discovered that 3D geometric information plays a more vital role in predicting molecular functionalities. However, the lack of 3D information in real-world scenarios has significantly impeded the learning of geometric graph representation. To cope with this challenge, we propose the Graph Multi-View Pre-training (GraphMVP) framework where self-supervised learning (SSL) is performed by leveraging the correspondence and consistency between 2D topological structures and 3D geometric views. GraphMVP effectively learns a 2D molecular graph encoder that is enhanced by richer and more discriminative 3D geometry. We further provide theoretical insights to justify the effectiveness of GraphMVP. Finally, comprehensive experiments show that GraphMVP can consistently outperform existing graph SSL methods.

Shengchao Liu, Hanchen Wang, Weiyang Liu, Joan Lasenby, Hongyu Guo, Jian Tang• 2021

Related benchmarks

TaskDatasetResultRank
Molecular property predictionQM9 (test)
mu0.031
229
Molecular property predictionMoleculeNet BBBP (scaffold)
ROC AUC72.4
140
Molecular property predictionMoleculeNet SIDER (scaffold)
ROC-AUC0.639
120
Molecular property predictionMoleculeNet BACE (scaffold)
ROC-AUC81.2
110
Molecular property predictionMoleculeNet MUV (scaffold)
ROC-AUC0.777
91
molecule property predictionMoleculeNet (scaffold split)
BBBP72.4
85
Molecular property predictionTOXCAST (scaffold)
ROC-AUC63.1
75
Molecular property predictionMoleculeNet HIV (scaffold)
ROC AUC77
66
Molecular Property ClassificationClinTox (scaffold)
ROC-AUC0.791
65
Molecular property predictionBACE (test)
ROC-AUC81.2
65
Showing 10 of 39 rows

Other info

Follow for update