A Cognitive Process-Inspired Architecture for Subject-Agnostic Brain Visual Decoding
About
Subject-agnostic brain decoding, which aims to reconstruct continuous visual experiences from fMRI without subject-specific training, holds great potential for clinical applications. However, this direction remains underexplored due to challenges in cross-subject generalization and the complex nature of brain signals. In this work, we propose Visual Cortex Flow Architecture (VCFlow), a novel hierarchical decoding framework that explicitly models the ventral-dorsal architecture of the human visual system to learn multi-dimensional representations. By disentangling and leveraging features from early visual cortex, ventral, and dorsal streams, VCFlow captures diverse and complementary cognitive information essential for visual reconstruction. Furthermore, we introduce a feature-level contrastive learning strategy to enhance the extraction of subject-invariant semantic representations, thereby enhancing subject-agnostic applicability to previously unseen subjects. Unlike conventional pipelines that need more than 12 hours of per-subject data and heavy computation, VCFlow sacrifices only 7\% accuracy on average yet generates each reconstructed video in 10 seconds without any retraining, offering a fast and clinically scalable solution. The source code will be released upon acceptance of the paper.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| fMRI-to-Video Reconstruction | cc Average across Subjects 2017 | Semantic Acc (Frame, 50-way)14 | 8 | |
| fMRI-to-Video Reconstruction | cc Subject 1 2017 (test) | Frame Semantic Acc (50-way)14.2 | 5 | |
| fMRI-to-Video Reconstruction | cc Subject 2 2017 (test) | Frame-based Semantic-level 50-way Accuracy13.2 | 5 | |
| fMRI-to-Video Reconstruction | cc Subject 3 2017 (test) | Frame Semantic Acc (50-way)14.7 | 5 | |
| fMRI-to-Video Reconstruction | cc Subject 1 2017 | Semantic Acc (Frame, 50-way)14.2 | 3 | |
| fMRI-to-Video Reconstruction | cc Subject 2 2017 | Frame Semantic 50-way Acc13.2 | 3 | |
| fMRI-to-Video Reconstruction | cc Subject 3 2017 | Frame Semantic 50-way Acc14.7 | 3 |