Towards Self-Tuning Parameter Servers

Liu, Chris; Zhang, Pengfei; Tang, Bo; Shen, Hang; Zhu, Lei; Lai, Ziliang; Lo, Eric

Computer Science > Databases

arXiv:1810.02935v1 (cs)

[Submitted on 6 Oct 2018 (this version), latest version 4 Aug 2020 (v2)]

Title:Towards Self-Tuning Parameter Servers

Authors:Chris Liu, Pengfei Zhang, Bo Tang, Hang Shen, Lei Zhu, Ziliang Lai, Eric Lo

View PDF

Abstract:Recent years, many applications have been driven advances by the use of Machine Learning (ML). Nowadays, it is common to see industrial-strength machine learning jobs that involve millions of model parameters, terabytes of training data, and weeks of training. Good efficiency, i.e., fast completion time of running a specific ML job, therefore, is a key feature of a successful ML system. While the completion time of a long- running ML job is determined by the time required to reach model convergence, practically that is also largely influenced by the values of various system settings. In this paper, we contribute techniques towards building self-tuning parameter servers. Parameter Server (PS) is a popular system architecture for large-scale machine learning systems; and by self-tuning we mean while a long-running ML job is iteratively training the expert-suggested model, the system is also iteratively learning which system setting is more efficient for that job and applies it online. While our techniques are general enough to various PS- style ML systems, we have prototyped our techniques on top of TensorFlow. Experiments show that our techniques can reduce the completion times of a variety of long-running TensorFlow jobs from 1.4x to 18x.

Comments:	13 pages
Subjects:	Databases (cs.DB)
Cite as:	arXiv:1810.02935 [cs.DB]
	(or arXiv:1810.02935v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.1810.02935

Submission history

From: Chris Liu [view email]
[v1] Sat, 6 Oct 2018 05:12:23 UTC (1,719 KB)
[v2] Tue, 4 Aug 2020 14:51:00 UTC (1,744 KB)

Computer Science > Databases

Title:Towards Self-Tuning Parameter Servers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:Towards Self-Tuning Parameter Servers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators