Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy

Gast, Nicolas; Gaujal, Bruno; Yan, Chen

Computer Science > Performance

arXiv:2012.09064 (cs)

[Submitted on 16 Dec 2020]

Title:Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy

Authors:Nicolas Gast (POLARIS), Bruno Gaujal (POLARIS), Chen Yan (POLARIS)

View PDF

Abstract:We evaluate the performance of Whittle index policy for restless Markovian bandits, when the number of bandits grows. It is proven in [30] that this performance is asymptotically optimal if the bandits are indexable and the associated deterministic system has a global attractor fixed point. In this paper we show that, under the same conditions, the convergence rate is exponential in the number of bandits, unless the fixed point is singular (to be defined later). Our proof is based on the nature of the deterministic equation governing the stochastic system: We show that it is a piecewise affine continuous dynamical system inside the simplex of the empirical measure of the bandits. Using simulations and numerical solvers, we also investigate the cases where the conditions for the exponential rate theorem are violated, notably when attracting limit cycles appear, or when the fixed point is singular. We illustrate our theorem on a Markovian fading channel model, which has been well studied in the literature. Finally, we extend our synchronous model results to the asynchronous model.

Subjects:	Performance (cs.PF); Optimization and Control (math.OC); Probability (math.PR)
Cite as:	arXiv:2012.09064 [cs.PF]
	(or arXiv:2012.09064v1 [cs.PF] for this version)
	https://doi.org/10.48550/arXiv.2012.09064

Submission history

From: Nicolas Gast [view email] [via CCSD proxy]
[v1] Wed, 16 Dec 2020 16:34:39 UTC (437 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.PF

< prev | next >

new | recent | 2020-12

Change to browse by:

cs
math
math.OC
math.PR

References & Citations

DBLP - CS Bibliography

listing | bibtex

Nicolas Gast
Bruno Gaujal
Chen Yan

export BibTeX citation

Computer Science > Performance

Title:Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Performance

Title:Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators