MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation

Xiao, Zunjie; Zhang, Xiaoqing; Higashita, Risa; Liu, Jiang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.08600 (cs)

[Submitted on 16 Aug 2024]

Title:MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation

Authors:Zunjie Xiao, Xiaoqing Zhang, Risa Higashita, Jiang Liu

View PDF HTML (experimental)

Abstract:Ophthalmic image segmentation serves as a critical foundation for ocular disease diagnosis. Although fully convolutional neural networks (CNNs) are commonly employed for segmentation, they are constrained by inductive biases and face challenges in establishing long-range dependencies. Transformer-based models address these limitations but introduce substantial computational overhead. Recently, a simple yet efficient Multilayer Perceptron (MLP) architecture was proposed for image classification, achieving competitive performance relative to advanced transformers. However, its effectiveness for ophthalmic image segmentation remains unexplored. In this paper, we introduce MM-UNet, an efficient Mixed MLP model tailored for ophthalmic image segmentation. Within MM-UNet, we propose a multi-scale MLP (MMLP) module that facilitates the interaction of features at various depths through a grouping strategy, enabling simultaneous capture of global and local information. We conducted extensive experiments on both a private anterior segment optical coherence tomography (AS-OCT) image dataset and a public fundus image dataset. The results demonstrated the superiority of our MM-UNet model in comparison to state-of-the-art deep segmentation networks.

Comments:	OMIA2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2408.08600 [cs.CV]
	(or arXiv:2408.08600v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.08600

Submission history

From: Zunjie Xiao [view email]
[v1] Fri, 16 Aug 2024 08:34:50 UTC (1,526 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators