NeuroRVQ: Multi-Scale Biosignal Tokenization for Generative Foundation Models
About
Biosignals such as electroencephalography (EEG), electrocardiography (ECG), and electromyography (EMG) encode physiological activity across multiple temporal and spectral scales, yielding representations that are rich but challenging for machine learning. Foundation models trained to predict masked signal tokens have shown promise in learning generalizable biosignal representations, yet their performance depends on the tokenizer's ability to preserve high-frequency dynamics and reconstruct signals with high fidelity. We introduce NeuroRVQ, a modality-adaptive biosignal tokenizer family designed for high-fidelity signal reconstruction. To capture the full frequency spectrum, NeuroRVQ decomposes biosignals into frequency-specific representations via multi-scale temporal convolutions, each encoded into hierarchical RVQ codebooks to preserve high-frequency detail, combined with a novel phase-aware training loss that respects the circular topology of Fourier phase. By tuning the temporal resolution, number and size of temporal kernels and RVQ depth, this design adapts to the spectro-temporal characteristics of each biosignal modality. To validate that tokenizer quality drives downstream performance, we train a simple masked-token foundation model for each modality (NeuroRVQ-FM) using the corresponding NeuroRVQ tokenizer. The NeuroRVQ-FM family achieves competitive or superior downstream performance compared to existing modality-specific foundation models, demonstrating that high-fidelity tokenization is a critical factor for effective biosignal modeling.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Classification | EEG (subject-independent) | BAcc (Motor)70 | 8 | |
| EEG Signal Reconstruction | BrainOmni EEG sample data (out-of-distribution) | MSE0.191 | 4 | |
| ECG Classification | PTB-XL 43-class (subject-independent) | Accuracy79.17 | 3 | |
| EMG Classification | Discrete Gestures (subject-independent) | Balanced Accuracy (BAcc)70.8 | 3 | |
| EMG Classification | EPN-612 (subject-independent) | Accuracy94.65 | 3 | |
| EMG Classification | NinaPro DB5 (subject-independent) | Accuracy41.36 | 3 | |
| EMG Classification | UCI-EMG (subject-independent) | Accuracy89.43 | 3 | |
| ECG Classification | PTB-XL 5-class (subject-independent) | Accuracy70.19 | 3 | |
| Signal Reconstruction | High Gamma motor dataset 2017 (unseen) | Raw Signal MSE0.084 | 2 | |
| Signal Reconstruction | Working memory dataset 2022 (unseen) | Raw Signal MSE0.09 | 2 |