CSI-4CAST: A Hybrid Deep Learning Model for CSI Prediction with Comprehensive Robustness and Generalization Testing
About
Channel state information (CSI) prediction is a promising strategy for ensuring reliable and efficient operation of massive multiple-input multiple-output (mMIMO) systems by providing timely downlink (DL) CSI. While deep learning-based methods have advanced beyond conventional model-driven and statistical approaches, they remain limited in robustness to practical non-Gaussian noise, generalization across diverse channel conditions, and computational efficiency. This paper introduces CSI-4CAST, a hybrid deep learning architecture that integrates 4 key components, i.e., Convolutional neural network residuals, Adaptive correction layers, ShuffleNet blocks, and Transformers, to efficiently capture both local and long-range dependencies in CSI prediction. To enable rigorous evaluation, this work further presents a comprehensive benchmark, CSI-RRG for Regular, Robustness and Generalization testing, which includes more than 300,000 samples across 3,060 realistic scenarios for both TDD and FDD systems. The dataset spans multiple channel models, a wide range of delay spreads and user velocities, and diverse noise types and intensity degrees. Experimental results show that CSI-4CAST achieves superior prediction accuracy with substantially lower computational cost, outperforming baselines in 81.5% of TDD scenarios and 44.4% of FDD scenario, the best performance among all evaluated models, while reducing FLOPs by 5x and 3x compared to LLM4CP, the strongest baseline. In addition, evaluation over CSI-RRG provides valuable insights into how different channel factors affect the performance and generalization capability of deep learning models. Both the dataset (https://huggingface.co/CSI-4CAST) and evaluation protocols (https://github.com/AI4OPT/CSI-4CAST) are publicly released to establish a standardized benchmark and to encourage further research on robust and efficient CSI prediction.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| CSI Prediction | 3GPP CDL TDD (Regular split) | NMSE0.084 | 27 | |
| CSI Prediction | 3GPP CDL TDD (Generalization) | NMSE0.119 | 27 | |
| CSI Prediction | 3GPP CDL FDD (Regular) | NMSE0.19 | 21 | |
| CSI Prediction | 3GPP CDL FDD Generalization | NMSE0.495 | 21 | |
| CSI Prediction | 3GPP CDL TDD Regular Track (train) | CDL-A0.156 | 9 | |
| CSI Forecasting | TDD | FLOPs (G)71.9 | 9 | |
| CSI Prediction | 3GPP CDL TDD Track (generalization) | CDL-B Score0.376 | 9 | |
| CSI Forecasting | FDD | FLOPs (G)101.6 | 7 | |
| CSI Prediction | 3GPP CDL FDD Regular Track (train) | CDL-A0.385 | 7 | |
| CSI Prediction | 3GPP CDL FDD Generalization Track | CDL-B1.308 | 7 |