CMGL: Confidence-guided Multi-omics Graph Learning for Cancer Subtype Classification
About
Motivation: Multi-omics integration can improve cancer subtyping, but modality informativeness and noise vary across cancer types and patients. Existing graph-based methods optimize modality weights jointly with the classification objective and therefore lack independent reliability estimates, so low-quality omics distort patient similarity graphs and amplify noise through message passing. Results: We propose CMGL, a two-stage framework that estimates per-sample modality reliability through evidential deep learning and uses the frozen confidence scores to guide cross-omics fusion and graph construction. On four MLOmics cancer-subtype tasks and the 32-class pan-cancer task, CMGL consistently improves over the strongest baseline, surpassing it by 4.03% in average accuracy on the four single-cancer tasks. Its representations recover the PAM50 intrinsic subtypes of breast invasive carcinoma (BRCA), and the BRCA-trained model transfers without fine-tuning to kidney renal clear cell carcinoma (KIRC), stratifying patients into prognostically distinct groups.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cancer subtype classification | Pan-cancer 32-class | Accuracy0.9682 | 7 | |
| Cancer subtype classification | BRCA (5-fold cross-validation) | Accuracy85.69 | 5 | |
| Cancer subtype classification | GBM (5-fold val) | Accuracy82.38 | 5 | |
| Cancer subtype classification | LGG 5-fold cross-val | Accuracy95.95 | 5 | |
| Cancer subtype classification | OV 5-fold (val) | Accuracy79.91 | 5 |