Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PerturbDiff: Functional Diffusion for Single-Cell Perturbation Modeling

About

Building Virtual Cells that can accurately simulate cellular responses to perturbations is a long-standing goal in systems biology. A fundamental challenge is that high-throughput single-cell sequencing is destructive: the same cell cannot be observed both before and after a perturbation. Thus, perturbation prediction requires mapping unpaired control and perturbed populations. Existing models address this by learning maps between distributions, but typically assume a single fixed response distribution when conditioned on observed cellular context (e.g., cell type) and the perturbation type. In reality, responses vary systematically due to unobservable latent factors such as microenvironmental fluctuations and complex batch effects, forming a manifold of possible distributions for the same observed conditions. To account for this variability, we introduce PerturbDiff, which shifts modeling from individual cells to entire distributions. By embedding distributions as points in a Hilbert space, we define a diffusion-based generative process operating directly over probability distributions. This allows PerturbDiff to capture population-level response shifts across hidden factors. Benchmarks on established datasets show that PerturbDiff achieves state-of-the-art performance in single-cell response prediction and generalizes substantially better to unseen perturbations. See our project page (https://katarinayuan.github.io/PerturbDiff-ProjectPage/), where code and data will be made publicly available (https://github.com/DeepGraphLearning/PerturbDiff).

Xinyu Yuan, Xixian Liu, Ya Shi Zhang, Zuobai Zhang, Hongyu Guo, Jian Tang• 2026

Related benchmarks

TaskDatasetResultRank
Perturbation predictionPBMC
DE Overlap56.4
22
Perturbation predictionTahoe100M
DEOver0.58
22
Perturbation predictionReplogle
DEOver0.19
22
Single-cell perturbation response predictionPerturbQA K562
Overall AUROC0.888
4
Single-cell perturbation response predictionTahoe100M C32
Overall AUROC98.7
4
Showing 5 of 5 rows

Other info

Follow for update