A Cognitive Process-Inspired Architecture for Subject-Agnostic Brain Visual Decoding

About

Subject-agnostic brain decoding, which aims to reconstruct continuous visual experiences from fMRI without subject-specific training, holds great potential for clinical applications. However, this direction remains underexplored due to challenges in cross-subject generalization and the complex nature of brain signals. In this work, we propose Visual Cortex Flow Architecture (VCFlow), a novel hierarchical decoding framework that explicitly models the ventral-dorsal architecture of the human visual system to learn multi-dimensional representations. By disentangling and leveraging features from early visual cortex, ventral, and dorsal streams, VCFlow captures diverse and complementary cognitive information essential for visual reconstruction. Furthermore, we introduce a feature-level contrastive learning strategy to enhance the extraction of subject-invariant semantic representations, thereby enhancing subject-agnostic applicability to previously unseen subjects. Unlike conventional pipelines that need more than 12 hours of per-subject data and heavy computation, VCFlow sacrifices only 7\% accuracy on average yet generates each reconstructed video in 10 seconds without any retraining, offering a fast and clinically scalable solution. The source code will be released upon acceptance of the paper.

Jingyu Lu, Haonan Wang, Qixiang Zhang, Xiaomeng Li• 2025

Related benchmarks

Task	Dataset	Result
fMRI-to-Video Reconstruction	cc Average across Subjects 2017	Semantic Acc (Frame, 50-way)14	8
fMRI-to-Video Reconstruction	cc Subject 1 2017 (test)	Frame Semantic Acc (50-way)14.2	5
fMRI-to-Video Reconstruction	cc Subject 2 2017 (test)	Frame-based Semantic-level 50-way Accuracy13.2	5
fMRI-to-Video Reconstruction	cc Subject 3 2017 (test)	Frame Semantic Acc (50-way)14.7	5
fMRI-to-Video Reconstruction	cc Subject 1 2017	Semantic Acc (Frame, 50-way)14.2	3
fMRI-to-Video Reconstruction	cc Subject 2 2017	Frame Semantic 50-way Acc13.2	3
fMRI-to-Video Reconstruction	cc Subject 3 2017	Frame Semantic 50-way Acc14.7	3

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord