High-Performance Distributed ML at Scale through Parameter Server Consistency Models

Dai, Wei; Kumar, Abhimanu; Wei, Jinliang; Ho, Qirong; Gibson, Garth; Xing, Eric P.

Computer Science > Machine Learning

arXiv:1410.8043 (cs)

[Submitted on 29 Oct 2014]

Title:High-Performance Distributed ML at Scale through Parameter Server Consistency Models

Authors:Wei Dai, Abhimanu Kumar, Jinliang Wei, Qirong Ho, Garth Gibson, Eric P. Xing

View PDF

Abstract:As Machine Learning (ML) applications increase in data size and model complexity, practitioners turn to distributed clusters to satisfy the increased computational and memory demands. Unfortunately, effective use of clusters for ML requires considerable expertise in writing distributed code, while highly-abstracted frameworks like Hadoop have not, in practice, approached the performance seen in specialized ML implementations. The recent Parameter Server (PS) paradigm is a middle ground between these extremes, allowing easy conversion of single-machine parallel ML applications into distributed ones, while maintaining high throughput through relaxed "consistency models" that allow inconsistent parameter reads. However, due to insufficient theoretical study, it is not clear which of these consistency models can really ensure correct ML algorithm output; at the same time, there remain many theoretically-motivated but undiscovered opportunities to maximize computational throughput. Motivated by this challenge, we study both the theoretical guarantees and empirical behavior of iterative-convergent ML algorithms in existing PS consistency models. We then use the gleaned insights to improve a consistency model using an "eager" PS communication mechanism, and implement it as a new PS system that enables ML algorithms to reach their solution more quickly.

Comments:	19 pages, 2 figures
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1410.8043 [cs.LG]
	(or arXiv:1410.8043v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1410.8043

Submission history

From: Wei Dai [view email]
[v1] Wed, 29 Oct 2014 16:19:21 UTC (2,446 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat

< prev | next >

new | recent | 2014-10

Change to browse by:

cs
cs.LG
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Wei Dai
Abhimanu Kumar
Jinliang Wei
Qirong Ho
Garth A. Gibson

…

export BibTeX citation

Computer Science > Machine Learning

Title:High-Performance Distributed ML at Scale through Parameter Server Consistency Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:High-Performance Distributed ML at Scale through Parameter Server Consistency Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators