Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cell Morphology-Guided Small Molecule Generation with GFlowNets

About

High-content phenotypic screening, including high-content imaging (HCI), has gained popularity in the last few years for its ability to characterize novel therapeutics without prior knowledge of the protein target. When combined with deep learning techniques to predict and represent molecular-phenotype interactions, these advancements hold the potential to significantly accelerate and enhance drug discovery applications. This work focuses on the novel task of HCI-guided molecular design. Generative models for molecule design could be guided by HCI data, for example with a supervised model that links molecules to phenotypes of interest as a reward function. However, limited labeled data, combined with the high-dimensional readouts, can make training these methods challenging and impractical. We consider an alternative approach in which we leverage an unsupervised multimodal joint embedding to define a latent similarity as a reward for GFlowNets. The proposed model learns to generate new molecules that could produce phenotypic effects similar to those of the given image target, without relying on pre-annotated phenotypic labels. We demonstrate that the proposed method generates molecules with high morphological and structural similarity to the target, increasing the likelihood of similar biological activity, as confirmed by an independent oracle model.

Stephen Zhewen Lu, Ziqing Lu, Ehsan Hajiramezanali, Tommaso Biancalani, Yoshua Bengio, Gabriele Scalia, Micha{\l} Koziarski• 2024

Related benchmarks

TaskDatasetResultRank
MoA classificationCell-Phenotype de novo generated molecules CLOOME space
Top-1 Cluster Acc21.2
4
MoA classificationCell-Phenotype de novo generated molecules InfoAlign space
Top-1 Cluster Accuracy17.1
4
MoA classificationCell-Phenotype de novo generated molecules (ECFP space)
Top-1 Cluster Accuracy17
4
MoA classification of de novo generated moleculesJUMP Cell Painting intersected with ChEMBL2K and Broad Drug Repurposing Hub (test)
Top-1 Cluster Accuracy17
4
Cell-phenotype-guided molecular generationCell Phenotype
QED64.9
4
Showing 5 of 5 rows

Other info

Follow for update