Epsilon-Greedy Thompson Sampling to Bayesian Optimization

Do, Bach; Adebiyi, Taiwo; Zhang, Ruda

doi:10.1115/1.4066858

Computer Science > Machine Learning

arXiv:2403.00540 (cs)

[Submitted on 1 Mar 2024 (v1), last revised 30 Oct 2024 (this version, v3)]

Title:Epsilon-Greedy Thompson Sampling to Bayesian Optimization

Authors:Bach Do, Taiwo Adebiyi, Ruda Zhang

View PDF HTML (experimental)

Abstract:Bayesian optimization (BO) has become a powerful tool for solving simulation-based engineering optimization problems thanks to its ability to integrate physical and mathematical understandings, consider uncertainty, and address the exploitation-exploration dilemma. Thompson sampling (TS) is a preferred solution for BO to handle the exploitation-exploration trade-off. While it prioritizes exploration by generating and minimizing random sample paths from probabilistic models -- a fundamental ingredient of BO -- TS weakly manages exploitation by gathering information about the true objective function after it obtains new observations. In this work, we improve the exploitation of TS by incorporating the $\varepsilon$-greedy policy, a well-established selection strategy in reinforcement learning. We first delineate two extremes of TS, namely the generic TS and the sample-average TS. The former promotes exploration, while the latter favors exploitation. We then adopt the $\varepsilon$-greedy policy to randomly switch between these two extremes. Small and large values of $\varepsilon$ govern exploitation and exploration, respectively. By minimizing two benchmark functions and solving an inverse problem of a steel cantilever beam, we empirically show that $\varepsilon$-greedy TS equipped with an appropriate $\varepsilon$ is more robust than its two extremes, matching or outperforming the better of the generic TS and the sample-average TS.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2403.00540 [cs.LG]
	(or arXiv:2403.00540v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.00540
Related DOI:	https://doi.org/10.1115/1.4066858

Submission history

From: Ruda Zhang [view email]
[v1] Fri, 1 Mar 2024 13:53:44 UTC (3,045 KB)
[v2] Sat, 4 May 2024 14:04:48 UTC (1,081 KB)
[v3] Wed, 30 Oct 2024 20:22:36 UTC (1,138 KB)

Computer Science > Machine Learning

Title:Epsilon-Greedy Thompson Sampling to Bayesian Optimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Epsilon-Greedy Thompson Sampling to Bayesian Optimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators