Online Learning of Rested and Restless Bandits

Tekin, Cem; Liu, Mingyan

doi:10.1109/TIT.2012.2198613

Mathematics > Optimization and Control

arXiv:1102.3508 (math)

[Submitted on 17 Feb 2011]

Title:Online Learning of Rested and Restless Bandits

Authors:Cem Tekin, Mingyan Liu

View PDF

Abstract:In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K finite-state discrete-time Markov chains (arms) with unknown state spaces and statistics. At each time step the player can play M arms. The objective of the user is to decide for each step which M of the K arms to play over a sequence of trials so as to maximize its long term reward. The restless multiarmed bandit is particularly relevant to the application of opportunistic spectrum access (OSA), where a (secondary) user has access to a set of K channels, each of time-varying condition as a result of random fading and/or certain primary users' activities.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG)
Cite as:	arXiv:1102.3508 [math.OC]
	(or arXiv:1102.3508v1 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1102.3508
Journal reference:	Information Theory, IEEE Transactions on , vol.58, no.8, pp.5588,5611, Aug. 2012
Related DOI:	https://doi.org/10.1109/TIT.2012.2198613

Submission history

From: Cem Tekin [view email]
[v1] Thu, 17 Feb 2011 07:08:37 UTC (173 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2011-02

Change to browse by:

cs
math
math.OC

References & Citations

export BibTeX citation

Mathematics > Optimization and Control

Title:Online Learning of Rested and Restless Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Online Learning of Rested and Restless Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators