FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

Cheng, Shenggan; Wu, Ruidong; Yu, Zhongming; Li, Binrui; Zhang, Xiwen; Peng, Jian; You, Yang

Computer Science > Machine Learning

arXiv:2203.00854v1 (cs)

[Submitted on 2 Mar 2022 (this version), latest version 5 Feb 2023 (v3)]

Title:FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

Authors:Shenggan Cheng, Ruidong Wu, Zhongming Yu, Binrui Li, Xiwen Zhang, Jian Peng, Yang You

View PDF

Abstract:Protein structure prediction is an important method for understanding gene translation and protein function in the domain of structural biology. AlphaFold introduced the Transformer model to the field of protein structure prediction with atomic accuracy. However, training and inference of the AlphaFold model are time-consuming and expensive because of the special performance characteristics and huge memory consumption. In this paper, we propose FastFold, a highly efficient implementation of protein structure prediction model for training and inference. FastFold includes a series of GPU optimizations based on a thorough analysis of AlphaFold's performance. Meanwhile, with \textit{Dynamic Axial Parallelism} and \textit{Duality Async Operation}, FastFold achieves high model parallelism scaling efficiency, surpassing existing popular model parallelism techniques. Experimental results show that FastFold reduces overall training time from 11 days to 67 hours and achieves $7.5\sim9.5\times$ speedup for long-sequence inference. Furthermore, We scaled FastFold to 512 GPUs and achieved an aggregate of 6.02 PetaFLOPs with 90.1\% parallel efficiency. The implementation can be found at this https URL.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Quantitative Methods (q-bio.QM)
Cite as:	arXiv:2203.00854 [cs.LG]
	(or arXiv:2203.00854v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.00854

Submission history

From: Yang You [view email]
[v1] Wed, 2 Mar 2022 03:59:51 UTC (1,674 KB)
[v2] Fri, 4 Mar 2022 10:08:04 UTC (1,674 KB)
[v3] Sun, 5 Feb 2023 13:31:06 UTC (7,299 KB)

Computer Science > Machine Learning

Title:FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators