Memory-Efficient Pipeline-Parallel DNN Training

Narayanan, Deepak; Phanishayee, Amar; Shi, Kaiyu; Chen, Xie; Zaharia, Matei

Computer Science > Machine Learning

arXiv:2006.09503v1 (cs)

[Submitted on 16 Jun 2020 (this version), latest version 22 Jul 2021 (v3)]

Title:Memory-Efficient Pipeline-Parallel DNN Training

Authors:Deepak Narayanan, Amar Phanishayee, Kaiyu Shi, Xie Chen, Matei Zaharia

View PDF

Abstract:Many state-of-the-art results in domains such as NLP and computer vision have been obtained by scaling up the number of parameters in existing models. However, the weight parameters and intermediate outputs of these large models often do not fit in the main memory of a single accelerator device; this means that it is necessary to use multiple accelerators to train large models, which is challenging to do in a time-efficient way. In this work, we propose PipeDream-2BW, a system that performs memory-efficient pipeline parallelism, a hybrid form of parallelism that combines data and model parallelism with input pipelining. Our system uses a novel pipelining and weight gradient coalescing strategy, combined with the double buffering of weights, to ensure high throughput, low memory footprint, and weight update semantics similar to data parallelism. In addition, PipeDream-2BW automatically partitions the model over the available hardware resources, while being cognizant of constraints such as compute capabilities, memory capacities, and interconnect topologies, and determines when to employ existing memory-savings techniques, such as activation recomputation, that trade off extra computation for lower memory footprint. PipeDream-2BW is able to accelerate the training of large language models with up to 2.5 billion parameters by up to 6.9x compared to optimized baselines.

Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
Cite as:	arXiv:2006.09503 [cs.LG]
	(or arXiv:2006.09503v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.09503

Submission history

From: Deepak Narayanan [view email]
[v1] Tue, 16 Jun 2020 20:33:54 UTC (1,049 KB)
[v2] Thu, 18 Feb 2021 05:01:32 UTC (563 KB)
[v3] Thu, 22 Jul 2021 17:25:58 UTC (2,360 KB)

Computer Science > Machine Learning

Title:Memory-Efficient Pipeline-Parallel DNN Training

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Memory-Efficient Pipeline-Parallel DNN Training

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators