Bidirectional Mamba for Single-Cell Data: Efficient Context Learning with Biological Fidelity
About
Single-cell RNA sequencing (scRNA-seq) enables high-resolution analysis of cellular heterogeneity, but its complexity, which is marked by high dimensionality, sparsity, and batch effects, which poses major computational challenges. Transformer-based models have made significant advances in this domain but are often limited by their quadratic complexity and suboptimal handling of long-range dependencies. In this work, we introduce GeneMamba, a scalable and efficient foundation model for single-cell transcriptomics built on state space modeling. Leveraging the Bi-Mamba architecture, GeneMamba captures bidirectional gene context with linear-time complexity, offering substantial computational gains over transformer baselines. The model is pretrained on nearly 30 million cells and incorporates biologically informed objectives, including pathway-aware contrastive loss and rank-based gene encoding. We evaluate GeneMamba across diverse tasks, including multi-batch integration, cell type annotation, and gene-gene correlation, demonstrating strong performance, interpretability, and robustness. These results position GeneMamba as a practical and powerful alternative to transformer-based methods, advancing the development of biologically grounded, scalable tools for large-scale single-cell data analysis.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cell-type annotation | Myeloid | Accuracy66.07 | 9 | |
| Cell-type annotation | Myeloid_b | Accuracy96.03 | 9 | |
| Cell-type annotation | hPancreas | Accuracy97.13 | 9 | |
| Cell-type annotation | MS | Accuracy68.25 | 9 | |
| Cell-type annotation | hPancreas (test) | Accuracy97.13 | 6 | |
| Cell-type annotation | Myeloid (test) | Accuracy66.07 | 6 | |
| Cell-type annotation | Myeloid_b (test) | Accuracy0.9603 | 6 | |
| Gene rank reconstruction | PBMC12k | L-Dist6 | 6 | |
| Cell-type annotation | MS (test) | Accuracy0.6825 | 6 | |
| Multi-batch integration | Immune | Average Batch Score95.36 | 5 |