Asynch-SGBDT: Asynchronous Parallel Stochastic Gradient Boosting Decision Tree based on Parameters Server

Daning, Cheng; Fen, Xia; Shigang, Li; Yunquan, Zhang

doi:10.1109/IPDPS54959.2023.00034

Computer Science > Machine Learning

arXiv:1804.04659 (cs)

[Submitted on 12 Apr 2018 (v1), last revised 18 Jul 2019 (this version, v4)]

Title:Asynch-SGBDT: Asynchronous Parallel Stochastic Gradient Boosting Decision Tree based on Parameters Server

Authors:Cheng Daning, Xia Fen, Li Shigang, Zhang Yunquan

View PDF

Abstract:In AI research and industry, machine learning is the most widely used tool. One of the most important machine learning algorithms is Gradient Boosting Decision Tree, i.e. GBDT whose training process needs considerable computational resources and time. To shorten GBDT training time, many works tried to apply GBDT on Parameter Server. However, those GBDT algorithms are synchronous parallel algorithms which fail to make full use of Parameter Server. In this paper, we examine the possibility of using asynchronous parallel methods to train GBDT model and name this algorithm as asynch-SGBDT (asynchronous parallel stochastic gradient boosting decision tree). Our theoretical and experimental results indicate that the scalability of asynch-SGBDT is influenced by the sample diversity of datasets, sampling rate, step length and the setting of GBDT tree. Experimental results also show asynch-SGBDT training process reaches a linear speedup in asynchronous parallel manner when datasets and GBDT trees meet high scalability requirements.

Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
Cite as:	arXiv:1804.04659 [cs.LG]
	(or arXiv:1804.04659v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1804.04659
Related DOI:	https://doi.org/10.1109/IPDPS54959.2023.00034

Submission history

From: Daning Cheng [view email]
[v1] Thu, 12 Apr 2018 14:06:05 UTC (862 KB)
[v2] Fri, 18 May 2018 04:26:26 UTC (814 KB)
[v3] Fri, 17 Aug 2018 01:57:44 UTC (813 KB)
[v4] Thu, 18 Jul 2019 06:50:05 UTC (873 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.DC

< prev | next >

new | recent | 2018-04

Change to browse by:

cs
cs.LG
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Daning Cheng
Fen Xia
Shigang Li
Yunquan Zhang

export BibTeX citation

Computer Science > Machine Learning

Title:Asynch-SGBDT: Asynchronous Parallel Stochastic Gradient Boosting Decision Tree based on Parameters Server

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Asynch-SGBDT: Asynchronous Parallel Stochastic Gradient Boosting Decision Tree based on Parameters Server

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators