RGE-GCN: Recursive Gene Elimination with Graph Convolutional Networks for RNA-seq based Early Cancer Detection
About
Early detection of cancer plays a key role in improving survival rates, but identifying reliable biomarkers from RNA-seq data is still a major challenge. The data are high-dimensional, and conventional statistical methods often fail to capture the complex relationships between genes. In this study, we introduce RGE-GCN (Recursive Gene Elimination with Graph Convolutional Networks), a framework that combines feature selection and classification in a single pipeline. Our approach builds a graph from gene expression profiles, uses a Graph Convolutional Network to classify cancer versus normal samples, and applies Integrated Gradients to highlight the most informative genes. By recursively removing less relevant genes, the model converges to a compact set of biomarkers that are both interpretable and predictive. We evaluated RGE-GCN on synthetic data as well as real-world RNA-seq cohorts of lung, kidney, and cervical cancers. Across all datasets, the method consistently achieved higher accuracy and F1-scores than standard tools such as DESeq2, edgeR, and limma-voom. Importantly, the selected genes aligned with well-known cancer pathways including PI3K-AKT, MAPK, SUMOylation, and immune regulation. These results suggest that RGE-GCN shows promise as a generalizable approach for RNA-seq based early cancer detection and biomarker discovery (https://rce-gcn.streamlit.app/ ).
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cancer Detection | Cervical cancer | F1 Score92.2 | 16 | |
| Cancer Detection | Lung cancer | F1 Score95 | 16 | |
| Cancer Detection | Kidney cancer | F1 Score0.913 | 16 | |
| RNA-seq Cancer Detection | Cervical cancer RNA-seq dataset | Accuracy90 | 16 | |
| RNA-seq Cancer Detection | Lung cancer RNA-seq dataset | Accuracy94.2 | 16 | |
| RNA-seq Cancer Detection | Kidney cancer RNA-seq dataset | Accuracy94.2 | 16 |