Optimal UCB Adjustments for Large Arm Sizes

Chan, Hock Peng; Hu, Shouri

Mathematics > Statistics Theory

arXiv:1909.02229 (math)

[Submitted on 5 Sep 2019]

Title:Optimal UCB Adjustments for Large Arm Sizes

Authors:Hock Peng Chan, Shouri Hu

View PDF

Abstract:The regret lower bound of Lai and Robbins (1985), the gold standard for checking optimality of bandit algorithms, considers arm size fixed as sample size goes to infinity. We show that when arm size increases polynomially with sample size, a surprisingly smaller lower bound is achievable. This is because the larger experimentation costs when there are more arms permit regret savings by exploiting the best performer more often. In particular we are able to construct a UCB-Large algorithm that adaptively exploits more when there are more arms. It achieves the smaller lower bound and is thus optimal. Numerical experiments show that UCB-Large performs better than classical UCB that does not correct for arm size, and better than Thompson sampling.

Comments:	First Draft
Subjects:	Statistics Theory (math.ST)
Cite as:	arXiv:1909.02229 [math.ST]
	(or arXiv:1909.02229v1 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.1909.02229

Submission history

From: Shouri Hu [view email]
[v1] Thu, 5 Sep 2019 06:30:17 UTC (19 KB)

Full-text links:

Access Paper:

view license

Current browse context:

math.ST

< prev | next >

new | recent | 2019-09

Change to browse by:

math
stat
stat.TH

References & Citations

export BibTeX citation

Mathematics > Statistics Theory

Title:Optimal UCB Adjustments for Large Arm Sizes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Optimal UCB Adjustments for Large Arm Sizes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators