Language Reconstruction with Brain Predictive Coding from fMRI Data
About
Many recent studies have shown that the perception of speech can be decoded from brain signals and subsequently reconstructed as continuous language. However, there is a lack of neurological basis for how the semantic information embedded within brain signals can be used more effectively to guide language reconstruction. Predictive coding theory suggests the human brain naturally engages in continuously predicting future words that span multiple timescales. This implies that the decoding of brain signals could potentially be associated with a predictable future. To explore the predictive coding theory within the context of language reconstruction, this paper proposes \textsc{PredFT}~(\textbf{F}MRI-to-\textbf{T}ext decoding with \textbf{Pred}ictive coding). \textsc{PredFT} consists of a main network and a side network. The side network obtains brain predictive representation from related regions of interest~(ROIs) with a self-attention module. The representation is then fused into the main network for continuous language decoding. Experiments on two naturalistic language comprehension fMRI datasets show that \textsc{PredFT} outperforms current decoding models on several evaluation metrics.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| cross-subject fMRI-to-text decoding | Narratives | BLEU-127.8 | 9 | |
| fMRI-to-text decoding | LeBel’s dataset Sub-1 | BLEU-134.95 | 5 | |
| fMRI-to-text decoding | LeBel Sub-2 | BLEU-132.46 | 5 | |
| fMRI-to-text decoding | LeBel’s dataset Sub-3 | BLEU-133.22 | 5 |