Repeated Inverse Reinforcement Learning

Amin, Kareem; Jiang, Nan; Singh, Satinder

Computer Science > Artificial Intelligence

arXiv:1705.05427 (cs)

[Submitted on 15 May 2017 (v1), last revised 4 Nov 2017 (this version, v3)]

Title:Repeated Inverse Reinforcement Learning

Authors:Kareem Amin, Nan Jiang, Satinder Singh

View PDF

Abstract:We introduce a novel repeated Inverse Reinforcement Learning problem: the agent has to act on behalf of a human in a sequence of tasks and wishes to minimize the number of tasks that it surprises the human by acting suboptimally with respect to how the human would have acted. Each time the human is surprised, the agent is provided a demonstration of the desired behavior by the human. We formalize this problem, including how the sequence of tasks is chosen, in a few different ways and provide some foundational results.

Comments:	The first two authors contributed equally to this work. The paper appears in NIPS 2017
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1705.05427 [cs.AI]
	(or arXiv:1705.05427v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1705.05427

Submission history

From: Nan Jiang [view email]
[v1] Mon, 15 May 2017 20:06:35 UTC (59 KB)
[v2] Thu, 18 May 2017 19:32:27 UTC (59 KB)
[v3] Sat, 4 Nov 2017 00:38:19 UTC (30 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2017-05

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Kareem Amin
Nan Jiang
Satinder P. Singh

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Repeated Inverse Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Repeated Inverse Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators