Unifying Data, Model and Hybrid Parallelism in Deep Learning via Tensor Tiling

Wang, Minjie; Huang, Chien-chin; Li, Jinyang

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:1805.04170 (cs)

[Submitted on 10 May 2018]

Title:Unifying Data, Model and Hybrid Parallelism in Deep Learning via Tensor Tiling

Authors:Minjie Wang, Chien-chin Huang, Jinyang Li

View PDF

Abstract:Deep learning systems have become vital tools across many fields, but the increasing model sizes mean that training must be accelerated to maintain such systems' utility. Current systems like Tensorflow and MXNet focus on one specific parallelization strategy, data parallelism, which requires large training batch sizes in order to scale. We cast the problem of finding the best parallelization strategy as the problem of finding the best tiling to partition tensors with the least overall communication. We propose an algorithm that can find the optimal tiling. Our resulting parallelization solution is a hybrid of data parallelism and model parallelism. We build the SoyBean system that performs automatic parallelization. SoyBean automatically transforms a serial dataflow graph captured by an existing deep learning system frontend into a parallel dataflow graph based on the optimal tiling it has found. Our evaluations show that SoyBean is 1.5x-4x faster than pure data parallelism for AlexNet and VGG. We present this automatic tiling in a new system, SoyBean, that can act as a backend for Tensorflow, MXNet, and others.

Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
Cite as:	arXiv:1805.04170 [cs.DC]
	(or arXiv:1805.04170v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.1805.04170

Submission history

From: Minjie Wang [view email]
[v1] Thu, 10 May 2018 20:38:56 UTC (1,353 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.DC

< prev | next >

new | recent | 2018-05

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Minjie Wang
Chien-chin Huang
Jinyang Li

export BibTeX citation

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Unifying Data, Model and Hybrid Parallelism in Deep Learning via Tensor Tiling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Unifying Data, Model and Hybrid Parallelism in Deep Learning via Tensor Tiling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators