Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PhAME: Phenotype-Aware Molecular Editing via Latent Diffusion

About

Small-molecule drug discovery requires simultaneous optimization of numerous properties of candidate molecules. These properties can be investigated through the analysis of high-dimensional biological signatures, such as cell morphology and transcriptomic perturbations, which provide a rich perspective on the underlying biological mechanisms. However, existing generative methods, which use those signatures for optimization, fail to meet two key requirements: providing precise guidance toward desired phenotypic signatures while maintaining structural proximity to a known hit. We introduce PhAME (Phenotype-Aware Molecular Editing), a latent diffusion framework that overcomes this challenge by recasting molecular optimization as editing in the latent space of a pretrained graph-based VAE. Our central contribution is a compositional classifier-free guidance scheme with two independent scales, one for the phenotype-conditioning and one for similarity to the seed structure, allowing practitioners to control the tradeoff between these two objectives. Empirical evaluations across diverse benchmarks, including docking score optimization and multimodal phenotypic generation, demonstrate that PhAME achieves state-of-the-art results while maintaining high chemical validity and novelty.

{\L}ukasz Janisi\'ow, Sebastian Musia{\l}, Bartosz Zieli\'nski, Dawid Rymarczyk, Tomasz Danel• 2026

Related benchmarks

TaskDatasetResultRank
LogP optimization with similarity preservationMOSES
Nov0.99
8
LogP optimization with similarity preservationMolecular logP optimization dataset
Validity0.71
7
Gene-conditioned Molecule GenerationPARP1, FA7, 5HT1B, BRAF, and JAK2 targets ZINC250k-trained (average across targets)
QED71
4
MoA classificationCell-Phenotype de novo generated molecules InfoAlign space
Top-1 Cluster Accuracy28.7
4
MoA classificationCell-Phenotype de novo generated molecules (ECFP space)
Top-1 Cluster Accuracy27.2
4
MoA classification of de novo generated moleculesJUMP Cell Painting intersected with ChEMBL2K and Broad Drug Repurposing Hub (test)
Top-1 Cluster Accuracy27.2
4
Cell-phenotype-guided molecular generationCell Phenotype
QED67.4
4
MoA classificationCell-Phenotype de novo generated molecules CLOOME space
Top-1 Cluster Acc17.8
4
Showing 8 of 8 rows

Other info

Follow for update