CodeBrain: Bridging Decoupled Tokenizer and Multi-Scale Architecture for EEG Foundation Model
About
Electroencephalography (EEG) provides real-time insights into brain activity and supports diverse applications in neuroscience. While EEG foundation models (EFMs) have emerged to address the scalability issues of task-specific models, current approaches still yield clinically uninterpretable and weakly discriminative representations, inefficiently capturing global dependencies and neglecting important local neural events. We present CodeBrain, a two-stage EFM designed to fill this gap. In the first stage, we introduce the TFDual-Tokenizer, which decouples heterogeneous temporal and frequency EEG signals into discrete tokens, quadratically expanding the representation space to enhance discriminative power and offering domain-specific representation-level interpretability by suggesting potential links to neural events and spectral rhythms. In the second stage, we propose the multi-scale EEGSSM architecture, which combines structured global convolution with sliding window attention to efficiently capture both sparse long-range and local dependencies, reflecting the brain's small-world topology. Pretrained on the largest public EEG corpus, CodeBrain achieves strong generalization across eight downstream tasks and ten datasets under distribution shifts, supported by comprehensive ablations, scaling-law analyzes, and interpretability evaluations. The code and the pretrained weights are available at https://github.com/jingyingma01/CodeBrain.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Binary classification of normal versus abnormal EEG signals | TUAB | Balanced Accuracy82.94 | 113 | |
| EEG Classification | SEED | AUPRC64.3 | 32 | |
| EEG Classification | ISRUC | Kappa58.46 | 32 | |
| EEG Classification | Mental Arithmetic | AUPRC47.1 | 32 | |
| Six-class classification of EEG events | TUEV | Balanced Accuracy64.28 | 27 | |
| EEG Classification | TUEV (test) | Balanced Accuracy64.28 | 24 | |
| Binary Classification | CHB_MIT 2-Class (test) | AUROC0.8961 | 20 | |
| Binary Classification | Mental Arithmetic 2-Class (test) | AUROC87.07 | 10 | |
| EEG Classification | ISRUC_S1 5-Class | Cohen's Kappa0.7476 | 10 | |
| Motor Imagery Classification | SHU-MI 2-Class | Balanced Accuracy64.31 | 10 |