Optimal Best-Arm Identification in Bandits with Access to Offline Data

Agrawal, Shubhada; Juneja, Sandeep; Shanmugam, Karthikeyan; Suggala, Arun Sai

Computer Science > Machine Learning

arXiv:2306.09048 (cs)

[Submitted on 15 Jun 2023]

Title:Optimal Best-Arm Identification in Bandits with Access to Offline Data

Authors:Shubhada Agrawal, Sandeep Juneja, Karthikeyan Shanmugam, Arun Sai Suggala

View PDF

Abstract:Learning paradigms based purely on offline data as well as those based solely on sequential online learning have been well-studied in the literature. In this paper, we consider combining offline data with online learning, an area less studied but of obvious practical importance. We consider the stochastic $K$-armed bandit problem, where our goal is to identify the arm with the highest mean in the presence of relevant offline data, with confidence $1-\delta$. We conduct a lower bound analysis on policies that provide such $1-\delta$ probabilistic correctness guarantees. We develop algorithms that match the lower bound on sample complexity when $\delta$ is small. Our algorithms are computationally efficient with an average per-sample acquisition cost of $\tilde{O}(K)$, and rely on a careful characterization of the optimality conditions of the lower bound problem.

Comments:	45 pages, 5 figures
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2306.09048 [cs.LG]
	(or arXiv:2306.09048v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.09048

Submission history

From: Shubhada Agrawal [view email]
[v1] Thu, 15 Jun 2023 11:12:35 UTC (297 KB)

Computer Science > Machine Learning

Title:Optimal Best-Arm Identification in Bandits with Access to Offline Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Optimal Best-Arm Identification in Bandits with Access to Offline Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators