Spatial-Frequency Dual Progressive Attention Network For Medical Image Segmentation

Zhou, Zhenhuan; He, Along; Wu, Yanlin; Yao, Rui; Xie, Xueshuo; Li, Tao

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2406.07952 (eess)

[Submitted on 12 Jun 2024 (v1), last revised 19 Aug 2024 (this version, v2)]

Title:Spatial-Frequency Dual Progressive Attention Network For Medical Image Segmentation

Authors:Zhenhuan Zhou, Along He, Yanlin Wu, Rui Yao, Xueshuo Xie, Tao Li

View PDF HTML (experimental)

Abstract:In medical images, various types of lesions often manifest significant differences in their shape and texture. Accurate medical image segmentation demands deep learning models with robust capabilities in multi-scale and boundary feature learning. However, previous networks still have limitations in addressing the above issues. Firstly, previous networks simultaneously fuse multi-level features or employ deep supervision to enhance multi-scale learning. However, this may lead to feature redundancy and excessive computational overhead, which is not conducive to network training and clinical deployment. Secondly, the majority of medical image segmentation networks exclusively learn features in the spatial domain, disregarding the abundant global information in the frequency domain. This results in a bias towards low-frequency components, neglecting crucial high-frequency information. To address these problems, we introduce SF-UNet, a spatial-frequency dual-domain attention network. It comprises two main components: the Multi-scale Progressive Channel Attention (MPCA) block, which progressively extract multi-scale features across adjacent encoder layers, and the lightweight Frequency-Spatial Attention (FSA) block, with only 0.05M parameters, enabling concurrent learning of texture and boundary features from both spatial and frequency domains. We validate the effectiveness of the proposed SF-UNet on three public datasets. Experimental results show that compared to previous state-of-the-art (SOTA) medical image segmentation networks, SF-UNet achieves the best performance, and achieves up to 9.4\% and 10.78\% improvement in DSC and IOU. Codes will be released at this https URL.

Comments:	6 pages accepted by 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2024)
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.07952 [eess.IV]
	(or arXiv:2406.07952v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2406.07952

Submission history

From: Zhenhuan Zhou [view email]
[v1] Wed, 12 Jun 2024 07:22:05 UTC (2,937 KB)
[v2] Mon, 19 Aug 2024 14:56:05 UTC (2,935 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Spatial-Frequency Dual Progressive Attention Network For Medical Image Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Spatial-Frequency Dual Progressive Attention Network For Medical Image Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators