Inverse POMDP: Inferring What You Think from What You Do

Wu, Zhengwei; Schrater, Paul; Pitkow, Xaq

Computer Science > Machine Learning

arXiv:1805.09864v1 (cs)

[Submitted on 24 May 2018 (this version), latest version 11 Jun 2019 (v4)]

Title:Inverse POMDP: Inferring What You Think from What You Do

Authors:Zhengwei Wu, Paul Schrater, Xaq Pitkow

View PDF

Abstract:Complex behaviors are often driven by an internal model, which integrates sensory information over time and facilitates long-term planning. Inferring the internal model is a crucial ingredient for interpreting neural activities of agents and is beneficial for imitation learning. Here we describe a method to infer an agent's internal model and dynamic beliefs, and apply it to a simulated agent performing a foraging task. We assume the agent behaves rationally according to their understanding of the task and the relevant causal variables that cannot be fully observed. We model this rational solution as a Partially Observable Markov Decision Process (POMDP). However, we allow that the agent may have wrong assumptions about the task, and our method learns these assumptions from the agent's this http URL the agent's sensory observations and actions, we learn its internal model by maximum likelihood estimation over a set of task-relevant parameters. The Markov property of the POMDP enables us to characterize the transition probabilities between internal states and iteratively estimate the agent's policy using a constrained Expectation-Maximization algorithm. We validate our method on simulated agents performing suboptimally on a foraging task, and successfully recover the agent's actual model.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1805.09864 [cs.LG]
	(or arXiv:1805.09864v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.09864

Submission history

From: Zhengwei Wu [view email]
[v1] Thu, 24 May 2018 19:36:12 UTC (560 KB)
[v2] Tue, 18 Sep 2018 15:18:53 UTC (560 KB)
[v3] Sun, 7 Oct 2018 04:41:28 UTC (565 KB)
[v4] Tue, 11 Jun 2019 19:59:06 UTC (3,216 KB)

Computer Science > Machine Learning

Title:Inverse POMDP: Inferring What You Think from What You Do

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Inverse POMDP: Inferring What You Think from What You Do

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators