Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hierarchical Molecular Representation Learning via Fragment-Based Self-Supervised Embedding Prediction

About

Graph self-supervised learning (GSSL) has demonstrated strong potential for generating expressive graph embeddings without the need for human annotations, making it particularly valuable in domains with high labeling costs such as molecular graph analysis. However, existing GSSL methods mostly focus on node- or edge-level information, often ignoring chemically relevant substructures which strongly influence molecular properties. In this work, we propose Graph Semantic Predictive Network (GraSPNet), a hierarchical self-supervised framework that explicitly models both atomic-level and fragment-level semantics. GraSPNet decomposes molecular graphs into chemically meaningful fragments without predefined vocabularies and learns node- and fragment-level representations through multi-level message passing with masked semantic prediction at both levels. This hierarchical semantic supervision enables GraSPNet to learn multi-resolution structural information that is both expressive and transferable. Extensive experiments on multiple molecular property prediction benchmarks demonstrate that GraSPNet learns chemically meaningful representations and consistently outperforms state-of-the-art GSSL methods in transfer learning settings.

Jiele Wu, Haozhe Ma, Zhihan Guo, Thanh Vinh Vo, Tze Yun Leong• 2026

Related benchmarks

TaskDatasetResultRank
Molecular property predictionMoleculeNet BBBP (scaffold)
ROC AUC74.4
117
Molecular property predictionMoleculeNet SIDER (scaffold)
ROC-AUC0.625
97
Molecular property predictionMoleculeNet BACE (scaffold)
ROC-AUC82.9
87
Molecular property predictionMoleculeNet MUV (scaffold)
ROC-AUC0.785
68
Molecular property predictionTOXCAST (scaffold)
ROC-AUC65.5
52
Molecular Property ClassificationClinTox (scaffold)
ROC-AUC0.841
42
Molecular Property Prediction (Regression)Lipophilicity
RMSE0.813
23
Molecular Property Prediction (Regression)ESOL
RMSE1.161
23
RegressionFreeSolv
RMSE1.232
20
Molecular property predictionTox21 scaffold split
ROC-AUC77.3
17
Showing 10 of 11 rows

Other info

Follow for update