Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction

About

Decoding natural visual scenes from brain activity has flourished, with extensive research in single-subject tasks and, however, less in cross-subject tasks. Reconstructing high-quality images in cross-subject tasks is a challenging problem due to profound individual differences between subjects and the scarcity of data annotation. In this work, we proposed MindTuner for cross-subject visual decoding, which achieves high-quality and rich semantic reconstructions using only 1 hour of fMRI training data benefiting from the phenomena of visual fingerprint in the human visual system and a novel fMRI-to-text alignment paradigm. Firstly, we pre-train a multi-subject model among 7 subjects and fine-tune it with scarce data on new subjects, where LoRAs with Skip-LoRAs are utilized to learn the visual fingerprint. Then, we take the image modality as the intermediate pivot modality to achieve fMRI-to-text alignment, which achieves impressive fMRI-to-text retrieval performance and corrects fMRI-to-image reconstruction with fine-tuned semantics. The results of both qualitative and quantitative analyses demonstrate that MindTuner surpasses state-of-the-art cross-subject visual decoding models on the Natural Scenes Dataset (NSD), whether using training data of 1 hour or 40 hours.

Zixuan Gong, Qi Zhang, Guangyin Bao, Lei Zhu, Ke Liu, Liang Hu, Duoqian Miao• 2024

Related benchmarks

TaskDatasetResultRank
fMRI-to-image reconstructionNSD (Subjects 01, 02, 05, 07)
Inception Feature Similarity95.6
32
fMRI-to-image visual decodingNatural Scenes Dataset (NSD) Subject 1
Pixel Correlation (PixCorr)0.262
4
fMRI-to-image visual decodingNatural Scenes Dataset (NSD) Subject 2
Pixel Correlation0.225
4
fMRI-to-image visual decodingNatural Scenes Dataset (NSD) Subject 5
Pixel Correlation0.208
4
fMRI-to-image visual decodingNatural Scenes Dataset (NSD) Subject 7
PixCorr0.202
4
fMRI Image ReconstructionSubject 1
SSIM (color)37.1
4
Showing 6 of 6 rows

Other info

Follow for update