Vision Mamba for Classification of Breast Ultrasound Images

Nasiri-Sarvi, Ali; Hosseini, Mahdi S.; Rivaz, Hassan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.03552 (cs)

[Submitted on 4 Jul 2024 (v1), last revised 17 Sep 2024 (this version, v2)]

Title:Vision Mamba for Classification of Breast Ultrasound Images

Authors:Ali Nasiri-Sarvi, Mahdi S. Hosseini, Hassan Rivaz

View PDF HTML (experimental)

Abstract:Mamba-based models, VMamba and Vim, are a recent family of vision encoders that offer promising performance improvements in many computer vision tasks. This paper compares Mamba-based models with traditional Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) using the breast ultrasound BUSI dataset and Breast Ultrasound B dataset. Our evaluation, which includes multiple runs of experiments and statistical significance analysis, demonstrates that some of the Mamba-based architectures often outperform CNN and ViT models with statistically significant results. For example, in the B dataset, the best Mamba-based models have a 1.98\% average AUC and a 5.0\% average Accuracy improvement compared to the best non-Mamba-based model in this study. These Mamba-based models effectively capture long-range dependencies while maintaining some inductive biases, making them suitable for applications with limited data. The code is available at \url{this https URL}

Comments:	Accepted in MICCAI 2024 Deep-Breath workshop
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.03552 [cs.CV]
	(or arXiv:2407.03552v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.03552

Submission history

From: Mahdi S. Hosseini Dr. [view email]
[v1] Thu, 4 Jul 2024 00:21:47 UTC (331 KB)
[v2] Tue, 17 Sep 2024 04:37:16 UTC (332 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Vision Mamba for Classification of Breast Ultrasound Images

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Vision Mamba for Classification of Breast Ultrasound Images

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators