A recurrent vision transformer shows signatures of primate visual attention

Morgan, Jonathan; Albanna, Badr; Herman, James P.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.10955 (cs)

[Submitted on 16 Feb 2025]

Title:A recurrent vision transformer shows signatures of primate visual attention

Authors:Jonathan Morgan, Badr Albanna, James P. Herman

View PDF HTML (experimental)

Abstract:Attention is fundamental to both biological and artificial intelligence, yet research on animal attention and AI self attention remains largely disconnected. We propose a Recurrent Vision Transformer (Recurrent ViT) that integrates self-attention with recurrent memory, allowing both current inputs and stored information to guide attention allocation. Trained solely via sparse reward feedback on a spatially cued orientation change detection task, a paradigm used in primate studies, our model exhibits primate like signatures of attention, including improved accuracy and faster responses for cued stimuli that scale with cue validity. Analysis of self-attention maps reveals dynamic spatial prioritization with reactivation prior to expected changes, and targeted perturbations produce performance shifts similar to those observed in primate frontal eye fields and superior colliculus. These findings demonstrate that incorporating recurrent feedback into self attention can capture key aspects of primate visual attention.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
Cite as:	arXiv:2502.10955 [cs.CV]
	(or arXiv:2502.10955v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.10955

Submission history

From: Jonathan Morgan [view email]
[v1] Sun, 16 Feb 2025 02:22:27 UTC (41,439 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A recurrent vision transformer shows signatures of primate visual attention

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A recurrent vision transformer shows signatures of primate visual attention

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators