Speech language models lack important brain-relevant semantics

Oota, Subba Reddy; Çelik, Emin; Deniz, Fatma; Toneva, Mariya

Computer Science > Computation and Language

arXiv:2311.04664v1 (cs)

[Submitted on 8 Nov 2023 (this version), latest version 16 Jun 2024 (v2)]

Title:Speech language models lack important brain-relevant semantics

Authors:Subba Reddy Oota, Emin Çelik, Fatma Deniz, Mariya Toneva

View PDF

Abstract:Despite known differences between reading and listening in the brain, recent work has shown that text-based language models predict both text-evoked and speech-evoked brain activity to an impressive degree. This poses the question of what types of information language models truly predict in the brain. We investigate this question via a direct approach, in which we eliminate information related to specific low-level stimulus features (textual, speech, and visual) in the language model representations, and observe how this intervention affects the alignment with fMRI brain recordings acquired while participants read versus listened to the same naturalistic stories. We further contrast our findings with speech-based language models, which would be expected to predict speech-evoked brain activity better, provided they model language processing in the brain well. Using our direct approach, we find that both text-based and speech-based language models align well with early sensory regions due to shared low-level features. Text-based models continue to align well with later language regions even after removing these features, while, surprisingly, speech-based models lose most of their alignment. These findings suggest that speech-based models can be further improved to better reflect brain-like language processing.

Comments:	23 pages, 16 figures
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Neurons and Cognition (q-bio.NC)
Cite as:	arXiv:2311.04664 [cs.CL]
	(or arXiv:2311.04664v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.04664

Submission history

From: Subba Reddy Oota [view email]
[v1] Wed, 8 Nov 2023 13:11:48 UTC (7,570 KB)
[v2] Sun, 16 Jun 2024 23:52:21 UTC (11,042 KB)

Computer Science > Computation and Language

Title:Speech language models lack important brain-relevant semantics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Speech language models lack important brain-relevant semantics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators