Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching

About

Spatial transcriptomics (ST) has emerged as a powerful technology for bridging histology imaging with gene expression profiling. However, its application has been limited by low throughput and the need for specialized experimental facilities. Prior works sought to predict ST from whole-slide histology images to accelerate this process, but they suffer from two major limitations. First, they do not explicitly model cell-cell interaction as they factorize the joint distribution of whole-slide ST data and predict the gene expression of each spot independently. Second, their encoders struggle with memory constraints due to the large number of spots (often exceeding 10,000) in typical ST datasets. Herein, we propose STFlow, a flow matching generative model that considers cell-cell interaction by modeling the joint distribution of gene expression of an entire slide. It also employs an efficient slide-level encoder with local spatial attention, enabling whole-slide processing without excessive memory overhead. On the recently curated HEST-1k and STImage-1K4M benchmarks, STFlow substantially outperforms state-of-the-art baselines and achieves over 18% relative improvements over the pathology foundation models.

Tinglin Huang, Tianyu Liu, Mehrtash Babadi, Wengong Jin, Rex Ying• 2025

Related benchmarks

TaskDatasetResultRank
Spatial gene expression predictioncSCC (test)
MSE0.903
15
Spatial Transcriptomics PredictionHEST-1K Kidney 1.0 (test)
PCC0.3145
12
gene expression predictionIDC Top 50 HVGs
Macro-Avg PCC0.547
8
Spatial gene expression predictionHer2ST (test)
PCC-500.543
7
Spatial gene expression predictionKidney (test)
PCC-500.391
7
Spatial gene expression predictionHEST-1k HER2ST cohort (test)
PCC0.7058
6
Spatial gene expression predictionHEST-1k PRAD cohort (test)
PCC0.5337
6
Spatial Domain IdentificationDLPFC Slide 151673
ARI0.2453
6
DEG Consistency AnalysisDLPFC Slide 151673
Overlap Ratio (Top-20)55.71
5
gene expression predictionCCRCC Top 50 HVGs
Macro-Avg PCC0.14
4
Showing 10 of 19 rows

Other info

Follow for update