Learning Relative Gene Expression Trends from Pathology Images in Spatial Transcriptomics
About
Gene expression estimation from pathology images has the potential to reduce the RNA sequencing cost. Point-wise loss functions have been widely used to minimize the discrepancy between predicted and absolute gene expression values. However, due to the complexity of the sequencing techniques and intrinsic variability across cells, the observed gene expression contains stochastic noise and batch effects, and estimating the absolute expression values accurately remains a significant challenge. To mitigate this, we propose a novel objective of learning relative expression patterns rather than absolute levels. We assume that the relative expression levels of genes exhibit consistent patterns across independent experiments, even when absolute expression values are affected by batch effects and stochastic noise in tissue samples. Based on the assumption, we model the relation and propose a novel loss function called STRank that is robust to noise and batch effects. Experiments using synthetic datasets and real datasets demonstrate the effectiveness of the proposed method. The code is available at https://github.com/naivete5656/STRank.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Gene expression estimation | COAD Visium (multi-cohort) | SCC0.3456 | 28 | |
| Gene expression estimation | HER2ST 50 genes 1 (test) | SCC0.26 | 7 | |
| Gene expression estimation | HER2ST 250 genes 1 (test) | SCC0.194 | 7 | |
| Gene expression estimation | HER2ST 1000 genes 1 (test) | SCC0.176 | 7 | |
| Gene expression estimation | HER2ST 5000 genes 1 (test) | SCC0.177 | 7 | |
| Gene expression estimation | HER2ST 9385 genes 1 (test) | SCC0.173 | 7 | |
| Gene expression estimation | HEST 1k (test) | IDC0.51 | 7 |