A Spatial-channel-temporal-fused Attention for Spiking Neural Networks

Cai, Wuque; Sun, Hongze; Liu, Rui; Cui, Yan; Wang, Jun; Xia, Yang; Yao, Dezhong; Guo, Daqing

Computer Science > Computer Vision and Pattern Recognition

arXiv:2209.10837 (cs)

[Submitted on 22 Sep 2022 (v1), last revised 28 May 2023 (this version, v3)]

Title:A Spatial-channel-temporal-fused Attention for Spiking Neural Networks

Authors:Wuque Cai, Hongze Sun, Rui Liu, Yan Cui, Jun Wang, Yang Xia, Dezhong Yao, Daqing Guo

View PDF

Abstract:Spiking neural networks (SNNs) mimic brain computational strategies, and exhibit substantial capabilities in spatiotemporal information processing. As an essential factor for human perception, visual attention refers to the dynamic process for selecting salient regions in biological vision systems. Although visual attention mechanisms have achieved great success in computer vision applications, they are rarely introduced into SNNs. Inspired by experimental observations on predictive attentional remapping, we propose a new spatial-channel-temporal-fused attention (SCTFA) module that can guide SNNs to efficiently capture underlying target regions by utilizing accumulated historical spatial-channel information in the present study. Through a systematic evaluation on three event stream datasets (DVS Gesture, SL-Animals-DVS and MNIST-DVS), we demonstrate that the SNN with the SCTFA module (SCTFA-SNN) not only significantly outperforms the baseline SNN (BL-SNN) and two other SNN models with degenerated attention modules, but also achieves competitive accuracy with existing state-of-the-art methods. Additionally, our detailed analysis shows that the proposed SCTFA-SNN model has strong robustness to noise and outstanding stability when faced with incomplete data, while maintaining acceptable complexity and efficiency. Overall, these findings indicate that incorporating appropriate cognitive mechanisms of the brain may provide a promising approach to elevate the capabilities of SNNs.

Comments:	14 pages, 9 figures, 5 tabes; This work has been submitted to the IEEE for possible publication
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Neurons and Cognition (q-bio.NC)
Cite as:	arXiv:2209.10837 [cs.CV]
	(or arXiv:2209.10837v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2209.10837

Submission history

From: Daqing Guo [view email]
[v1] Thu, 22 Sep 2022 07:45:55 UTC (7,250 KB)
[v2] Wed, 17 May 2023 10:12:35 UTC (10,188 KB)
[v3] Sun, 28 May 2023 09:44:32 UTC (10,189 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Spatial-channel-temporal-fused Attention for Spiking Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Spatial-channel-temporal-fused Attention for Spiking Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators