Specification Inference from Demonstrations

Vazquez-Chanlatte, Marcell; Jha, Susmit; Tiwari, Ashish; Seshia, Sanjit A.

Computer Science > Machine Learning

arXiv:1710.03875v1 (cs)

[Submitted on 11 Oct 2017 (this version), latest version 27 Oct 2018 (v5)]

Title:Specification Inference from Demonstrations

Authors:Marcell Vazquez-Chanlatte, Susmit Jha, Ashish Tiwari, Sanjit A. Seshia

View PDF

Abstract:Learning from expert demonstrations has received a lot of attention in artificial intelligence and machine learning. The goal is to infer the underlying reward function that an agent is optimizing given a set of observations of the agent's behavior over time in a variety of circumstances, the system state trajectories, and a plant model specifying the evolution of the system state for different agent's actions. The system is often modeled as a Markov decision process, that is, the next state depends only on the current state and agent's action, and the the agent's choice of action depends only on the current state. While the former is a Markovian assumption on the evolution of system state, the later assumes that the target reward function is itself Markovian. In this work, we explore learning a class of non-Markovian reward functions, known in the formal methods literature as specifications. These specifications offer better composition, transferability, and interpretability. We then show that inferring the specification can be done efficiently without unrolling the transition system. We demonstrate on a 2-d grid world example.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
Cite as:	arXiv:1710.03875 [cs.LG]
	(or arXiv:1710.03875v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1710.03875

Submission history

From: Marcell Vazquez-Chanlatte [view email]
[v1] Wed, 11 Oct 2017 01:31:14 UTC (235 KB)
[v2] Wed, 14 Feb 2018 06:03:22 UTC (973 KB)
[v3] Mon, 13 Aug 2018 00:32:09 UTC (1,788 KB)
[v4] Tue, 14 Aug 2018 03:32:12 UTC (1,788 KB)
[v5] Sat, 27 Oct 2018 16:49:13 UTC (2,614 KB)

Computer Science > Machine Learning

Title:Specification Inference from Demonstrations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Specification Inference from Demonstrations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators