Where2Start: Leveraging initial States for Robust and Sample-Efficient Reinforcement Learning

Parsa, Pouya; Moayedi, Raoof Zare; Bornosi, Mohammad; Bejani, Mohammad Mahdi

Computer Science > Machine Learning

arXiv:2311.15089 (cs)

[Submitted on 25 Nov 2023]

Title:Where2Start: Leveraging initial States for Robust and Sample-Efficient Reinforcement Learning

Authors:Pouya Parsa, Raoof Zare Moayedi, Mohammad Bornosi, Mohammad Mahdi Bejani

View PDF

Abstract:The reinforcement learning algorithms that focus on how to compute the gradient and choose next actions, are effectively improved the performance of the agents. However, these algorithms are environment-agnostic. This means that the algorithms did not use the knowledge that has been captured by trajectory. This poses that the algorithms should sample many trajectories to train the model. By considering the essence of environment and how much the agent learn from each scenario in that environment, the strategy of the learning procedure can be changed. The strategy retrieves more informative trajectories, so the agent can learn with fewer trajectory sample. We propose Where2Start algorithm that selects the initial state so that the agent has more instability in vicinity of that state. We show that this kind of selection decreases number of trajectories that should be sampled that the agent reach to acceptable reward. Our experiments shows that Where2Start can improve sample efficiency up to 8 times. Also Where2Start can combined with most of state-of-the-art algorithms and improve that robustness and sample efficiency significantly.

Comments:	9 pages, 3 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2311.15089 [cs.LG]
	(or arXiv:2311.15089v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.15089

Submission history

From: Mohammad Mahdi Bejani [view email]
[v1] Sat, 25 Nov 2023 18:00:26 UTC (104 KB)

Computer Science > Machine Learning

Title:Where2Start: Leveraging initial States for Robust and Sample-Efficient Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Where2Start: Leveraging initial States for Robust and Sample-Efficient Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators