Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Bidirectional Generation of Structure and Properties Through a Single Molecular Foundation Model

About

The recent success of large foundation models in artificial intelligence has prompted the emergence of chemical pre-trained models. Despite the growing interest in large molecular pre-trained models that provide informative representations for downstream tasks, attempts for multimodal pre-training approaches on the molecule domain were limited. To address this, we present a novel multimodal molecular pre-trained model that incorporates the modalities of structure and biochemical properties, drawing inspiration from recent advances in multimodal learning techniques. Our proposed model pipeline of data handling and training objectives aligns the structure/property features in a common embedding space, which enables the model to regard bidirectional information between the molecules' structure and properties. These contributions emerge synergistic knowledge, allowing us to tackle both multimodal and unimodal downstream tasks through a single model. Through extensive experiments, we demonstrate that our model shows remarkable capabilities in solving various meaningful chemical challenges, including conditional molecule generation, property prediction, molecule classification, and reaction prediction.

Jinho Chang, Jong Chul Ye• 2022

Related benchmarks

TaskDatasetResultRank
Molecular Property ClassificationMoleculeNet BBBP
ROC AUC74.9
41
Molecular Property ClassificationMoleculeNet BACE
ROC AUC84.8
36
Molecular Property ClassificationMoleculeNet ClinTox
ROC-AUC91.8
27
Molecular Property ClassificationMoleculeNet SIDER
ROC-AUC0.655
21
RegressionMoleculeNet LIPO
RMSE0.692
19
RegressionMoleculeNet BACE (test)
RMSE1.096
14
RegressionMoleculeNet Delaney ESOL
RMSE0.818
10
RegressionMoleculeNet Freesolv
RMSE1.907
9
Forward reaction predictionReaction Prediction Dataset
Top-1 Accuracy91.5
7
RegressionMoleculeNet Clearance
RMSE42.841
5
Showing 10 of 12 rows

Other info

Code

Follow for update