Optimization as Estimation with Gaussian Processes in Bandit Settings

Wang, Zi; Zhou, Bolei; Jegelka, Stefanie

Statistics > Machine Learning

arXiv:1510.06423 (stat)

[Submitted on 21 Oct 2015 (v1), last revised 12 Aug 2018 (this version, v4)]

Title:Optimization as Estimation with Gaussian Processes in Bandit Settings

Authors:Zi Wang, Bolei Zhou, Stefanie Jegelka

View PDF

Abstract:Recently, there has been rising interest in Bayesian optimization -- the optimization of an unknown function with assumptions usually expressed by a Gaussian Process (GP) prior. We study an optimization strategy that directly uses an estimate of the argmax of the function. This strategy offers both practical and theoretical advantages: no tradeoff parameter needs to be selected, and, moreover, we establish close connections to the popular GP-UCB and GP-PI strategies. Our approach can be understood as automatically and adaptively trading off exploration and exploitation in GP-UCB and GP-PI. We illustrate the effects of this adaptive tuning via bounds on the regret as well as an extensive empirical evaluation on robotics and vision tasks, demonstrating the robustness of this strategy for a range of performance criteria.

Comments:	Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS) 2016, Cadiz, Spain
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1510.06423 [stat.ML]
	(or arXiv:1510.06423v4 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1510.06423

Submission history

From: Zi Wang [view email]
[v1] Wed, 21 Oct 2015 20:35:13 UTC (4,283 KB)
[v2] Sat, 31 Oct 2015 18:22:28 UTC (4,283 KB)
[v3] Sun, 24 Apr 2016 16:56:38 UTC (4,030 KB)
[v4] Sun, 12 Aug 2018 16:03:00 UTC (5,268 KB)

Statistics > Machine Learning

Title:Optimization as Estimation with Gaussian Processes in Bandit Settings

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Optimization as Estimation with Gaussian Processes in Bandit Settings

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators