Mind the Gap: Aligning the Brain with Language Models Requires a Nonlinear and Multimodal Approach

Han, Danny Dongyeop; Cho, Yunju; Cha, Jiook; Lee, Jay-Yoon

Computer Science > Computation and Language

arXiv:2502.12771 (cs)

[Submitted on 18 Feb 2025]

Title:Mind the Gap: Aligning the Brain with Language Models Requires a Nonlinear and Multimodal Approach

Authors:Danny Dongyeop Han, Yunju Cho, Jiook Cha, Jay-Yoon Lee

View PDF HTML (experimental)

Abstract:Self-supervised language and audio models effectively predict brain responses to speech. However, traditional prediction models rely on linear mappings from unimodal features, despite the complex integration of auditory signals with linguistic and semantic information across widespread brain networks during speech comprehension. Here, we introduce a nonlinear, multimodal prediction model that combines audio and linguistic features from pre-trained models (e.g., LLAMA, Whisper). Our approach achieves a 17.2% and 17.9% improvement in prediction performance (unnormalized and normalized correlation) over traditional unimodal linear models, as well as a 7.7% and 14.4% improvement, respectively, over prior state-of-the-art models. These improvements represent a major step towards future robust in-silico testing and improved decoding performance. They also reveal how auditory and semantic information are fused in motor, somatosensory, and higher-level semantic regions, aligning with existing neurolinguistic theories. Overall, our work highlights the often neglected potential of nonlinear and multimodal approaches to brain modeling, paving the way for future studies to embrace these strategies in naturalistic neurolinguistics research.

Subjects:	Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
Cite as:	arXiv:2502.12771 [cs.CL]
	(or arXiv:2502.12771v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.12771

Submission history

From: Danny Dongyeop Han [view email]
[v1] Tue, 18 Feb 2025 11:33:28 UTC (21,282 KB)

Computer Science > Computation and Language

Title:Mind the Gap: Aligning the Brain with Language Models Requires a Nonlinear and Multimodal Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Mind the Gap: Aligning the Brain with Language Models Requires a Nonlinear and Multimodal Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators