Weighted Sampling for Combined Model Selection and Hyperparameter Tuning

Sarigiannis, Dimitrios; Parnell, Thomas; Pozidis, Haris

Computer Science > Machine Learning

arXiv:1909.07140 (cs)

[Submitted on 16 Sep 2019 (v1), last revised 21 Nov 2019 (this version, v3)]

Title:Weighted Sampling for Combined Model Selection and Hyperparameter Tuning

Authors:Dimitrios Sarigiannis, Thomas Parnell, Haris Pozidis

View PDF

Abstract:The combined algorithm selection and hyperparameter tuning (CASH) problem is characterized by large hierarchical hyperparameter spaces. Model-free hyperparameter tuning methods can explore such large spaces efficiently since they are highly parallelizable across multiple machines. When no prior knowledge or meta-data exists to boost their performance, these methods commonly sample random configurations following a uniform distribution. In this work, we propose a novel sampling distribution as an alternative to uniform sampling and prove theoretically that it has a better chance of finding the best configuration in a worst-case setting. In order to compare competing methods rigorously in an experimental setting, one must perform statistical hypothesis testing. We show that there is little-to-no agreement in the automated machine learning literature regarding which methods should be used. We contrast this disparity with the methods recommended by the broader statistics literature, and identify a suitable approach. We then select three popular model-free solutions to CASH and evaluate their performance, with uniform sampling as well as the proposed sampling scheme, across 67 datasets from the OpenML platform. We investigate the trade-off between exploration and exploitation across the three algorithms, and verify empirically that the proposed sampling distribution improves performance in all cases.

Comments:	Accepted for presentation at The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1909.07140 [cs.LG]
	(or arXiv:1909.07140v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1909.07140

Submission history

From: Thomas Parnell [view email]
[v1] Mon, 16 Sep 2019 12:01:12 UTC (100 KB)
[v2] Tue, 17 Sep 2019 07:57:49 UTC (101 KB)
[v3] Thu, 21 Nov 2019 12:19:57 UTC (99 KB)

Computer Science > Machine Learning

Title:Weighted Sampling for Combined Model Selection and Hyperparameter Tuning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Weighted Sampling for Combined Model Selection and Hyperparameter Tuning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators