From Flat to Round: Redefining Brain Decoding with Surface-Based fMRI and Cortex Structure
About
Reconstructing visual stimuli from human brain activity (e.g., fMRI) bridges neuroscience and computer vision by decoding neural representations. However, existing methods often overlook critical brain structure-function relationships, flattening spatial information and neglecting individual anatomical variations. To address these issues, we propose (1) a novel sphere tokenizer that explicitly models fMRI signals as spatially coherent 2D spherical data on the cortical surface; (2) integration of structural MRI (sMRI) data, enabling personalized encoding of individual anatomical variations; and (3) a positive-sample mixup strategy for efficiently leveraging multiple fMRI scans associated with the same visual stimulus. Collectively, these innovations enhance reconstruction accuracy, biological interpretability, and generalizability across individuals. Experiments demonstrate superior reconstruction performance compared to SOTA methods, highlighting the effectiveness and interpretability of our biologically informed approach.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| fMRI Decoding | NSD (Natural Scenes Dataset) shared (test) | Pixel Correlation0.165 | 11 | |
| Brain Captioning | NSD subj01 (test) | BLEU-149.66 | 3 | |
| Brain Captioning | NSD subj02 (test) | BLEU-149.37 | 3 | |
| Brain Captioning | NSD subj05 (test) | BLEU-149.7 | 3 | |
| Brain Captioning | NSD subj07 (test) | BLEU-148.61 | 3 | |
| Brain Captioning | NSD Average (test) | BLEU-149.33 | 3 |