FLAG: Foundation model representation with Latent diffusion Alignment via Graph for spatial gene expression prediction
About
Predicting spatial gene expression from routine H\&E enables large-scale molecular profiling, yet current models treat this as isolated pointwise tasks, thereby overlooking essential biological structures like gene coordination and spatial distribution. To preserve these relationships, we introduce \textbf{FLAG}, a diffusion-based framework that redefines this task as structured distribution modeling. At the same time, we identify the critical \textbf{Gene Dimension Curse}, where joint modeling gene expression and their spatial interactions fail in high-dimensional spaces, and FLAG solves this challenge by integrating a spatial graph encoder for topological consistency and utilizing Gene Foundation Model (GFM) alignment for gene-gene fidelity in the generation process. To rigorously assess model performance, we propose a set of novel structural evaluation metrics, including Gene Structural Correlation (\textbf{GSC}) and Spatial Structural Correlation (\textbf{SSC}). Our experiments demonstrate that FLAG is highly competitive in traditional accuracy (PCC/MSE) while achieving significantly enhanced structural fidelity in capturing both gene-gene and gene-spatial relationships. The code is available at https://github.com/darkflash03/FLAG.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Spatial Transcriptomics Prediction | HEST-1K Kidney 1.0 (test) | PCC0.3917 | 12 | |
| Spatial gene expression prediction | HEST-1k PRAD cohort (test) | PCC0.5853 | 6 | |
| Spatial Domain Identification | DLPFC Slide 151673 | ARI0.3654 | 6 | |
| Spatial gene expression prediction | HEST-1k HER2ST cohort (test) | PCC0.6835 | 6 | |
| DEG Consistency Analysis | DLPFC Slide 151673 | Overlap Ratio (Top-20)66.57 | 5 | |
| Spatial gene expression prediction | HER2ST HMHVG-200 genes panel | PCC0.6835 | 3 |