Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NeuroRVQ: Multi-Scale Biosignal Tokenization for Generative Foundation Models

About

Biosignals such as electroencephalography (EEG), electrocardiography (ECG), and electromyography (EMG) encode physiological activity across multiple temporal and spectral scales, yielding representations that are rich but challenging for machine learning. Foundation models trained to predict masked signal tokens have shown promise in learning generalizable biosignal representations, yet their performance depends on the tokenizer's ability to preserve high-frequency dynamics and reconstruct signals with high fidelity. We introduce NeuroRVQ, a modality-adaptive biosignal tokenizer family designed for high-fidelity signal reconstruction. To capture the full frequency spectrum, NeuroRVQ decomposes biosignals into frequency-specific representations via multi-scale temporal convolutions, each encoded into hierarchical RVQ codebooks to preserve high-frequency detail, combined with a novel phase-aware training loss that respects the circular topology of Fourier phase. By tuning the temporal resolution, number and size of temporal kernels and RVQ depth, this design adapts to the spectro-temporal characteristics of each biosignal modality. To validate that tokenizer quality drives downstream performance, we train a simple masked-token foundation model for each modality (NeuroRVQ-FM) using the corresponding NeuroRVQ tokenizer. The NeuroRVQ-FM family achieves competitive or superior downstream performance compared to existing modality-specific foundation models, demonstrating that high-fidelity tokenization is a critical factor for effective biosignal modeling.

Konstantinos Barmpas, Na Lee, Dimitrios Chalatsis, William Raftery, Yannis Panagakis, Dimitrios A. Adamos, Nikolaos Laskaris, Alexandros Koliousis, Dario Farina, Stefanos Zafeiriou• 2025

Related benchmarks

TaskDatasetResultRank
ClassificationEEG (subject-independent)
BAcc (Motor)70
8
EEG Signal ReconstructionBrainOmni EEG sample data (out-of-distribution)
MSE0.191
4
ECG ClassificationPTB-XL 43-class (subject-independent)
Accuracy79.17
3
EMG ClassificationDiscrete Gestures (subject-independent)
Balanced Accuracy (BAcc)70.8
3
EMG ClassificationEPN-612 (subject-independent)
Accuracy94.65
3
EMG ClassificationNinaPro DB5 (subject-independent)
Accuracy41.36
3
EMG ClassificationUCI-EMG (subject-independent)
Accuracy89.43
3
ECG ClassificationPTB-XL 5-class (subject-independent)
Accuracy70.19
3
Signal ReconstructionHigh Gamma motor dataset 2017 (unseen)
Raw Signal MSE0.084
2
Signal ReconstructionWorking memory dataset 2022 (unseen)
Raw Signal MSE0.09
2
Showing 10 of 10 rows

Other info

Follow for update