Asymptotic properties of a multicolored random reinforced urn model with an application to multi-armed bandits

Yang, Li; Hu, Jiang; Li, Jianghao; Bai, Zhidong

Mathematics > Statistics Theory

arXiv:2406.10854 (math)

[Submitted on 16 Jun 2024]

Title:Asymptotic properties of a multicolored random reinforced urn model with an application to multi-armed bandits

Authors:Li Yang, Jiang Hu, Jianghao Li, Zhidong Bai

View PDF HTML (experimental)

Abstract:The random self-reinforcement mechanism, characterized by the principle of ``the rich get richer'', has demonstrated significant utility across various domains. One prominent model embodying this mechanism is the random reinforcement urn model. This paper investigates a multicolored, multiple-drawing variant of the random reinforced urn model. We establish the limiting behavior of the normalized urn composition and demonstrate strong convergence upon scaling the counts of each color. Additionally, we derive strong convergence estimators for the reinforcement means, i.e., for the expectations of the replacement matrix's diagonal elements, and prove their joint asymptotic normality. It is noteworthy that the estimators of the largest reinforcement mean are asymptotically independent of the estimators of the other smaller reinforcement means. Additionally, if a reinforcement mean is not the largest, the estimators of these smaller reinforcement means will also demonstrate asymptotic independence among themselves. Furthermore, we explore the parallels between the reinforced mechanisms in random reinforced urn models and multi-armed bandits, addressing hypothesis testing for expected payoffs in the latter context.

Subjects:	Statistics Theory (math.ST)
Cite as:	arXiv:2406.10854 [math.ST]
	(or arXiv:2406.10854v1 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.2406.10854

Submission history

From: Li Yang [view email]
[v1] Sun, 16 Jun 2024 09:01:28 UTC (140 KB)

Mathematics > Statistics Theory

Title:Asymptotic properties of a multicolored random reinforced urn model with an application to multi-armed bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Asymptotic properties of a multicolored random reinforced urn model with an application to multi-armed bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators