An Accelerated Multi-level Monte Carlo Approach for Average Reward Reinforcement Learning with General Policy Parametrization

Ganesh, Swetha; Aggarwal, Vaneet

Computer Science > Machine Learning

arXiv:2407.18878 (cs)

[Submitted on 26 Jul 2024]

Title:An Accelerated Multi-level Monte Carlo Approach for Average Reward Reinforcement Learning with General Policy Parametrization

Authors:Swetha Ganesh, Vaneet Aggarwal

View PDF HTML (experimental)

Abstract:In our study, we delve into average-reward reinforcement learning with general policy parametrization. Within this domain, current guarantees either fall short with suboptimal guarantees or demand prior knowledge of mixing time. To address these issues, we introduce Randomized Accelerated Natural Actor Critic, a method that integrates Multi-level Monte-Carlo and Natural Actor Critic. Our approach is the first to achieve global convergence rate of $\tilde{\mathcal{O}}(1/\sqrt{T})$ without requiring knowledge of mixing time, significantly surpassing the state-of-the-art bound of $\tilde{\mathcal{O}}(1/T^{1/4})$.

Comments:	28 pages, 1 table
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2407.18878 [cs.LG]
	(or arXiv:2407.18878v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.18878

Submission history

From: Swetha Ganesh [view email]
[v1] Fri, 26 Jul 2024 17:16:31 UTC (39 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2024-07

Change to browse by:

cs.LG

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:An Accelerated Multi-level Monte Carlo Approach for Average Reward Reinforcement Learning with General Policy Parametrization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:An Accelerated Multi-level Monte Carlo Approach for Average Reward Reinforcement Learning with General Policy Parametrization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators