Advancing Volumetric Medical Image Segmentation via Global-Local Masked Autoencoder

Zhuang, Jia-Xin; Luo, Luyang; Chen, Hao

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.08913 (cs)

[Submitted on 15 Jun 2023 (v1), last revised 23 Aug 2023 (this version, v2)]

Title:Advancing Volumetric Medical Image Segmentation via Global-Local Masked Autoencoder

Authors:Jia-Xin Zhuang, Luyang Luo, Hao Chen

View PDF

Abstract:Masked autoencoder (MAE) is a promising self-supervised pre-training technique that can improve the representation learning of a neural network without human intervention. However, applying MAE directly to volumetric medical images poses two challenges: (i) a lack of global information that is crucial for understanding the clinical context of the holistic data, (ii) no guarantee of stabilizing the representations learned from randomly masked inputs. To address these limitations, we propose the \textbf{G}lobal-\textbf{L}ocal \textbf{M}asked \textbf{A}uto\textbf{E}ncoder (GL-MAE), a simple yet effective self-supervised pre-training strategy. In addition to reconstructing masked local views, as in previous methods, GL-MAE incorporates global context learning by reconstructing masked global views. Furthermore, a complete global view is integrated as an anchor to guide the reconstruction and stabilize the learning process through global-to-global consistency learning and global-to-local consistency learning. Finetuning results on multiple datasets demonstrate the superiority of our method over other state-of-the-art self-supervised algorithms, highlighting its effectiveness on versatile volumetric medical image segmentation tasks, even when annotations are scarce. Our codes and models will be released upon acceptance.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.08913 [cs.CV]
	(or arXiv:2306.08913v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.08913

Submission history

From: Jia-Xin Zhuang [view email]
[v1] Thu, 15 Jun 2023 07:32:10 UTC (3,760 KB)
[v2] Wed, 23 Aug 2023 16:07:52 UTC (5,727 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Advancing Volumetric Medical Image Segmentation via Global-Local Masked Autoencoder

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Advancing Volumetric Medical Image Segmentation via Global-Local Masked Autoencoder

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators