Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data

Antoniades, Antonis; Yu, Yiyi; Canzano, Joseph; Wang, William; Smith, Spencer LaVere

Quantitative Biology > Neurons and Cognition

arXiv:2311.00136v4 (q-bio)

[Submitted on 31 Oct 2023 (v1), last revised 15 Mar 2024 (this version, v4)]

Title:Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data

Authors:Antonis Antoniades, Yiyi Yu, Joseph Canzano, William Wang, Spencer LaVere Smith

View PDF HTML (experimental)

Abstract:State-of-the-art systems neuroscience experiments yield large-scale multimodal data, and these data sets require new tools for analysis. Inspired by the success of large pretrained models in vision and language domains, we reframe the analysis of large-scale, cellular-resolution neuronal spiking data into an autoregressive spatiotemporal generation problem. Neuroformer is a multimodal, multitask generative pretrained transformer (GPT) model that is specifically designed to handle the intricacies of data in systems neuroscience. It scales linearly with feature size, can process an arbitrary number of modalities, and is adaptable to downstream tasks, such as predicting behavior. We first trained Neuroformer on simulated datasets, and found that it both accurately predicted simulated neuronal circuit activity, and also intrinsically inferred the underlying neural circuit connectivity, including direction. When pretrained to decode neural responses, the model predicted the behavior of a mouse with only few-shot fine-tuning, suggesting that the model begins learning how to do so directly from the neural representations themselves, without any explicit supervision. We used an ablation study to show that joint training on neuronal responses and behavior boosted performance, highlighting the model's ability to associate behavioral and neural representations in an unsupervised manner. These findings show that Neuroformer can analyze neural datasets and their emergent properties, informing the development of models and hypotheses associated with the brain.

Comments:	9 pages for main paper. 22 pages in total. 13 figures, 1 table
Subjects:	Neurons and Cognition (q-bio.NC); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2311.00136 [q-bio.NC]
	(or arXiv:2311.00136v4 [q-bio.NC] for this version)
	https://doi.org/10.48550/arXiv.2311.00136

Submission history

From: Antonis Antoniades [view email]
[v1] Tue, 31 Oct 2023 20:17:32 UTC (8,756 KB)
[v2] Mon, 6 Nov 2023 21:18:26 UTC (8,756 KB)
[v3] Wed, 8 Nov 2023 19:48:12 UTC (10,154 KB)
[v4] Fri, 15 Mar 2024 22:07:06 UTC (11,468 KB)

Quantitative Biology > Neurons and Cognition

Title:Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Neurons and Cognition

Title:Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators