Merging Context Clustering with Visual State Space Models for Medical Image Segmentation

Zhu, Yun; Zhang, Dong; Lin, Yi; Feng, Yifei; Tang, Jinhui

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.01618 (cs)

[Submitted on 3 Jan 2025]

Title:Merging Context Clustering with Visual State Space Models for Medical Image Segmentation

Authors:Yun Zhu, Dong Zhang, Yi Lin, Yifei Feng, Jinhui Tang

View PDF HTML (experimental)

Abstract:Medical image segmentation demands the aggregation of global and local feature representations, posing a challenge for current methodologies in handling both long-range and short-range feature interactions. Recently, vision mamba (ViM) models have emerged as promising solutions for addressing model complexities by excelling in long-range feature iterations with linear complexity. However, existing ViM approaches overlook the importance of preserving short-range local dependencies by directly flattening spatial tokens and are constrained by fixed scanning patterns that limit the capture of dynamic spatial context information. To address these challenges, we introduce a simple yet effective method named context clustering ViM (CCViM), which incorporates a context clustering module within the existing ViM models to segment image tokens into distinct windows for adaptable local clustering. Our method effectively combines long-range and short-range feature interactions, thereby enhancing spatial contextual representations for medical image segmentation tasks. Extensive experimental evaluations on diverse public datasets, i.e., Kumar, CPM17, ISIC17, ISIC18, and Synapse demonstrate the superior performance of our method compared to current state-of-the-art methods. Our code can be found at this https URL.

Comments:	Our paper has been accepted by the IEEE Transactions on Medical Imaging. Our code can be found at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2501.01618 [cs.CV]
	(or arXiv:2501.01618v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.01618

Submission history

From: Yun Zhu [view email]
[v1] Fri, 3 Jan 2025 03:25:30 UTC (10,391 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Merging Context Clustering with Visual State Space Models for Medical Image Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Merging Context Clustering with Visual State Space Models for Medical Image Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators