DailyMAE: Towards Pretraining Masked Autoencoders in One Day

Wu, Jiantao; Mo, Shentong; Atito, Sara; Feng, Zhenhua; Kittler, Josef; Awais, Muhammad

Computer Science > Machine Learning

arXiv:2404.00509 (cs)

[Submitted on 31 Mar 2024]

Title:DailyMAE: Towards Pretraining Masked Autoencoders in One Day

Authors:Jiantao Wu, Shentong Mo, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais

View PDF HTML (experimental)

Abstract:Recently, masked image modeling (MIM), an important self-supervised learning (SSL) method, has drawn attention for its effectiveness in learning data representation from unlabeled data. Numerous studies underscore the advantages of MIM, highlighting how models pretrained on extensive datasets can enhance the performance of downstream tasks. However, the high computational demands of pretraining pose significant challenges, particularly within academic environments, thereby impeding the SSL research progress. In this study, we propose efficient training recipes for MIM based SSL that focuses on mitigating data loading bottlenecks and employing progressive training techniques and other tricks to closely maintain pretraining performance. Our library enables the training of a MAE-Base/16 model on the ImageNet 1K dataset for 800 epochs within just 18 hours, using a single machine equipped with 8 A100 GPUs. By achieving speed gains of up to 5.8 times, this work not only demonstrates the feasibility of conducting high-efficiency SSL training but also paves the way for broader accessibility and promotes advancement in SSL research particularly for prototyping and initial testing of SSL ideas. The code is available in this https URL.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.00509 [cs.LG]
	(or arXiv:2404.00509v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2404.00509

Submission history

From: Jiantao Wu [view email]
[v1] Sun, 31 Mar 2024 00:59:10 UTC (41,952 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Machine Learning

Title:DailyMAE: Towards Pretraining Masked Autoencoders in One Day

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Machine Learning

Title:DailyMAE: Towards Pretraining Masked Autoencoders in One Day

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators