Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Distribution-Conditioned Transport

About

Learning a transport model that maps a source distribution to a target distribution is a canonical problem in machine learning, but scientific applications increasingly require models that can generalize to source and target distributions unseen during training. We introduce distribution-conditioned transport (DCT), a framework that conditions transport maps on learned embeddings of source and target distributions, enabling generalization to unseen distribution pairs. DCT also allows semi-supervised learning for distributional forecasting problems: because it learns from arbitrary distribution pairs, it can leverage distributions observed at only one condition to improve transport prediction. DCT is agnostic to the underlying transport mechanism, supporting models ranging from flow matching to distributional divergence-based models (e.g. Wasserstein, MMD). We demonstrate the practical performance benefits of DCT on synthetic benchmarks and four applications in biology: batch effect transfer in single-cell genomics, perturbation prediction from mass cytometry data, learning clonal transcriptional dynamics in hematopoiesis, and modeling T-cell receptor sequence evolution.

Nic Fishman, Gokul Gowri, Paolo L. B. Fischer, Marinka Zitnik, Omar Abudayyeh, Jonathan Gootenberg• 2026

Related benchmarks

TaskDatasetResultRank
Perturbation predictionPDO (IID)
MMD Distance (x100)0.001
11
Perturbation predictionPDO (OOD)
MMD Distance0.0022
11
Clonal distribution forecastingWeinreb (628 held-out clones)
MMD Distance2.88
9
Batch effect transferMurine pancreas scRNA-seq (held-out donors)
MMD Distance0.0639
8
TCR repertoire forecastingTCR repertoire sequencing (held-out patients)
MMD Distance0.0078
6
Showing 5 of 5 rows

Other info

Follow for update