Subgoal Discovery Using a Free Energy Paradigm and State Aggregations

Mesbah, Amirhossein; Hosseini, Reshad; Shariatpanahi, Seyed Pooya; Ahmadabadi, Majid Nili

Computer Science > Machine Learning

arXiv:2412.16687 (cs)

[Submitted on 21 Dec 2024]

Title:Subgoal Discovery Using a Free Energy Paradigm and State Aggregations

Authors:Amirhossein Mesbah, Reshad Hosseini, Seyed Pooya Shariatpanahi, Majid Nili Ahmadabadi

View PDF HTML (experimental)

Abstract:Reinforcement learning (RL) plays a major role in solving complex sequential decision-making tasks. Hierarchical and goal-conditioned RL are promising methods for dealing with two major problems in RL, namely sample inefficiency and difficulties in reward shaping. These methods tackle the mentioned problems by decomposing a task into simpler subtasks and temporally abstracting a task in the action space. One of the key components for task decomposition of these methods is subgoal discovery. We can use the subgoal states to define hierarchies of actions and also use them in decomposing complex tasks. Under the assumption that subgoal states are more unpredictable, we propose a free energy paradigm to discover them. This is achieved by using free energy to select between two spaces, the main space and an aggregation space. The $model \; changes$ from neighboring states to a given state shows the unpredictability of a given state, and therefore it is used in this paper for subgoal discovery. Our empirical results on navigation tasks like grid-world environments show that our proposed method can be applied for subgoal discovery without prior knowledge of the task. Our proposed method is also robust to the stochasticity of environments.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2412.16687 [cs.LG]
	(or arXiv:2412.16687v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.16687

Submission history

From: Amirhossein Mesbah [view email]
[v1] Sat, 21 Dec 2024 16:26:47 UTC (1,753 KB)

Computer Science > Machine Learning

Title:Subgoal Discovery Using a Free Energy Paradigm and State Aggregations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Subgoal Discovery Using a Free Energy Paradigm and State Aggregations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators