EVA-Net: Subject-Independent EEG Motor Decoding with Video-Derived Motor Priors
About
Practical non-invasive Brain-Computer Interface (BCI) systems require EEG decoders with strong cross-subject generalization and minimal calibration. However, inter-subject variability and signal non-stationarity often entangle motor semantics with subject-specific noise, limiting subject-independent decoding. Recent multimodal approaches use text as a semantic anchor, yet text provides sparse and static supervision for inherently dynamic motor processes. To address this issue, we propose EVA-Net, a two-stage framework that uses action videos as semantic priors for subject-independent EEG motor decoding. In the first stage, EEG and video features are aligned in a shared space using cross-modal and supervised contrastive objectives to reduce subject-specific variation. In the second stage, video category prototypes and knowledge distillation transfer video-derived priors to an EEG-only classifier without adding inference overhead. Experiments on two public datasets show that EVA-Net achieves strong subject-independent decoding performance, including an 8.66% LOSO accuracy gain on EEGMMI. Ablation results further suggest that video provides a more effective semantic anchor than the text baseline considered in this work.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Motor Imagery decoding | BCI Competition IV 2a I (cross-subject) | Average Accuracy71.25 | 17 | |
| Motor Imagery decoding | EEGMMI (Leave-One-Subject-Out) | Accuracy72.3 | 6 | |
| Motor Imagery decoding | EEGMMI (pooled-subject K-fold cross-validation) | Accuracy76.1 | 5 | |
| Motor Imagery decoding | BCIC-IV-2a (subject-dependent (session-independent)) | Accuracy75.8 | 5 |