When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms

Liu, Yao; Brunskill, Emma

Computer Science > Machine Learning

arXiv:1805.09045 (cs)

[Submitted on 23 May 2018 (v1), last revised 17 Apr 2019 (this version, v4)]

Title:When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms

Authors:Yao Liu, Emma Brunskill

View PDF

Abstract:Efficient exploration is one of the key challenges for reinforcement learning (RL) algorithms. Most traditional sample efficiency bounds require strategic exploration. Recently many deep RL algorithms with simple heuristic exploration strategies that have few formal guarantees, achieve surprising success in many domains. These results pose an important question about understanding these exploration strategies such as $e$-greedy, as well as understanding what characterize the difficulty of exploration in MDPs. In this work we propose problem specific sample complexity bounds of $Q$ learning with random walk exploration that rely on several structural properties. We also link our theoretical results to some empirical benchmark domains, to illustrate if our bound gives polynomial sample complexity in these domains and how that is related with the empirical performance.

Comments:	Appeared in The 14th European Workshop on Reinforcement Learning (EWRL), 2018
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1805.09045 [cs.LG]
	(or arXiv:1805.09045v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.09045

Submission history

From: Yao Liu [view email]
[v1] Wed, 23 May 2018 10:43:56 UTC (98 KB)
[v2] Wed, 13 Jun 2018 07:11:34 UTC (98 KB)
[v3] Sat, 4 Aug 2018 01:13:03 UTC (94 KB)
[v4] Wed, 17 Apr 2019 19:58:38 UTC (94 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-05

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yao Liu
Emma Brunskill

export BibTeX citation

Computer Science > Machine Learning

Title:When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators