PaSE: Parallelization Strategies for Efficient DNN Training

Elango, Venmugil

doi:10.1109/IPDPS49936.2021.00111

Computer Science > Machine Learning

arXiv:2407.04001 (cs)

[Submitted on 4 Jul 2024]

Title:PaSE: Parallelization Strategies for Efficient DNN Training

Authors:Venmugil Elango

View PDF HTML (experimental)

Abstract:Training a deep neural network (DNN) requires substantial computational and memory requirements. It is common to use multiple devices to train a DNN to reduce the overall training time. There are several choices to parallelize each layer in a DNN. Exhaustively searching this list to find an optimal parallelization strategy is prohibitively time consuming and impractical. The standard practice is to use data parallelism because of its simplicity. However, data parallelism is often sub-optimal, and suffers from poor performance and high memory requirement. Expert-designed strategies have been proposed on a case-by-case basis using domain specific knowledge. These expert-designed strategies do not generalize well to DNNs other than the ones for which they were designed, and are not always necessarily the best choice.
In this paper, we propose an approach to automatically find efficient parallelization strategies for DNNs from their computation graphs. We present an efficient algorithm to compute these strategies within a reasonable time in practice. We evaluate the effectiveness of our approach on various DNNs. We also compare the performance of the strategies identified by our approach against data parallelism, expert-designed strategies, and the state-of-the-art approaches. Our results show that the strategies found using our approach outperform the baseline data parallelism strategy in all the cases. In addition, our strategies achieve better performance than the expert-designed strategies and the state-of-the-art approaches.

Comments:	Published as conference paper at IPDPS 2021
Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2407.04001 [cs.LG]
	(or arXiv:2407.04001v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.04001
Journal reference:	2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Portland, OR, USA, 2021, pp. 1025-1034
Related DOI:	https://doi.org/10.1109/IPDPS49936.2021.00111

Submission history

From: Venmugil Elango [view email]
[v1] Thu, 4 Jul 2024 15:21:20 UTC (233 KB)

Computer Science > Machine Learning

Title:PaSE: Parallelization Strategies for Efficient DNN Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:PaSE: Parallelization Strategies for Efficient DNN Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators