Self Pre-training with Adaptive Mask Autoencoders for Variable-Contrast 3D Medical Imaging

Das, Badhan Kumar; Zhao, Gengyan; Liu, Han; Re, Thomas J.; Comaniciu, Dorin; Gibson, Eli; Maier, Andreas

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2501.09096 (eess)

[Submitted on 15 Jan 2025]

Title:Self Pre-training with Adaptive Mask Autoencoders for Variable-Contrast 3D Medical Imaging

Authors:Badhan Kumar Das, Gengyan Zhao, Han Liu, Thomas J. Re, Dorin Comaniciu, Eli Gibson, Andreas Maier

View PDF HTML (experimental)

Abstract:The Masked Autoencoder (MAE) has recently demonstrated effectiveness in pre-training Vision Transformers (ViT) for analyzing natural images. By reconstructing complete images from partially masked inputs, the ViT encoder gathers contextual information to predict the missing regions. This capability to aggregate context is especially important in medical imaging, where anatomical structures are functionally and mechanically linked to surrounding regions. However, current methods do not consider variations in the number of input images, which is typically the case in real-world Magnetic Resonance (MR) studies. To address this limitation, we propose a 3D Adaptive Masked Autoencoders (AMAE) architecture that accommodates a variable number of 3D input contrasts per subject. A magnetic resonance imaging (MRI) dataset of 45,364 subjects was used for pretraining and a subset of 1648 training, 193 validation and 215 test subjects were used for finetuning. The performance demonstrates that self pre-training of this adaptive masked autoencoders can enhance the infarct segmentation performance by 2.8%-3.7% for ViT-based segmentation models.

Comments:	5 pages, ISBI 2025 accepted
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.09096 [eess.IV]
	(or arXiv:2501.09096v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2501.09096

Submission history

From: Badhan Kumar Das [view email]
[v1] Wed, 15 Jan 2025 19:29:31 UTC (1,056 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Self Pre-training with Adaptive Mask Autoencoders for Variable-Contrast 3D Medical Imaging

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Self Pre-training with Adaptive Mask Autoencoders for Variable-Contrast 3D Medical Imaging

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators