Multimodal Prototyping for cancer survival prediction
About
Multimodal survival methods combining gigapixel histology whole-slide images (WSIs) and transcriptomic profiles are particularly promising for patient prognostication and stratification. Current approaches involve tokenizing the WSIs into smaller patches (>10,000 patches) and transcriptomics into gene groups, which are then integrated using a Transformer for predicting outcomes. However, this process generates many tokens, which leads to high memory requirements for computing attention and complicates post-hoc interpretability analyses. Instead, we hypothesize that we can: (1) effectively summarize the morphological content of a WSI by condensing its constituting tokens using morphological prototypes, achieving more than 300x compression; and (2) accurately characterize cellular functions by encoding the transcriptomic profile with biological pathway prototypes, all in an unsupervised fashion. The resulting multimodal tokens are then processed by a fusion network, either with a Transformer or an optimal transport cross-alignment, which now operates with a small and fixed number of tokens without approximations. Extensive evaluation on six cancer types shows that our framework outperforms state-of-the-art methods with much less computation while unlocking new interpretability analyses.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Survival Prediction | TCGA-LUAD | C-index0.639 | 154 | |
| Survival Prediction | TCGA-UCEC | C-index0.69 | 142 | |
| Survival Prediction | TCGA-BRCA | C-index0.671 | 101 | |
| Survival Prediction | TCGA-BLCA | C-index0.689 | 94 | |
| Glioma Grading | TCGA GBM-LGG (3-fold val) | AUC89.12 | 48 | |
| Survival Analysis | TCGA-GBMLGG | C-index0.815 | 44 | |
| Survival Prediction | TCGA GBM-LGG Internal (test) | C-Index77.1 | 37 | |
| Survival Prediction | TCGA (test) | BLCA Score0.657 | 27 | |
| Survival Prediction | CPTAC External (test) | C-Index57.53 | 27 | |
| Diagnosis | TCGA GBM-LGG and IvyGAP (3-fold val) | AUC96.17 | 26 |