Scaling Next-Brain-Token Prediction for MEG
About
We present a large autoregressive model for source-space MEG that scales next-token prediction to long context across datasets and scanners: handling a corpus of over 500 hours and thousands of sessions across the three largest MEG datasets. A modified SEANet-style vector-quantizer reduces multichannel MEG into a flattened token stream on which we train a Qwen2.5-VL backbone from scratch to predict the next brain token and to recursively generate minutes of MEG from up to a minute of context. To evaluate long-horizon generation, we introduce task-matched tests: (i) on-manifold stability via generated-only drift compared to the time-resolved distribution of real sliding windows, and (ii) conditional specificity via correct context versus prompt-swap controls using a neurophysiologically grounded metric set. We train on CamCAN and Omega and run all analyses on held-out MOUS, establishing cross-dataset generalization. Across metrics, generations remain relatively stable over long rollouts and are closer to the correct continuation than swapped controls. Code available at: https://github.com/ricsinaruto/brain-gen.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Next-Brain-Token Prediction | MOUS Rest (test) | Covariance Distance0.088 | 2 | |
| MEG Signal Modeling | MEG (held-out) | Context Modeling Duration (s)60 | 2 | |
| Next-Brain-Token Prediction | MOUS Auditory (test) | Covariance Distance0.13 | 2 | |
| Next-Brain-Token Prediction | MOUS Visual (test) | Covariance Distance0.096 | 2 | |
| MEG Signal Modeling | MEG within-subject | -- | 1 |