Sample-Efficient Curriculum Reinforcement Learning for Complex Reward Functions

Freitag, Kilian; Ceder, Kristian; Laezza, Rita; Åkesson, Knut; Chehreghani, Morteza Haghir

Computer Science > Machine Learning

arXiv:2410.16790v1 (cs)

[Submitted on 22 Oct 2024 (this version), latest version 10 Feb 2025 (v2)]

Title:Sample-Efficient Curriculum Reinforcement Learning for Complex Reward Functions

Authors:Kilian Freitag, Kristian Ceder, Rita Laezza, Knut Åkesson, Morteza Haghir Chehreghani

View PDF HTML (experimental)

Abstract:Reinforcement learning (RL) shows promise in control problems, but its practical application is often hindered by the complexity arising from intricate reward functions with constraints. While the reward hypothesis suggests these competing demands can be encapsulated in a single scalar reward function, designing such functions remains challenging. Building on existing work, we start by formulating preferences over trajectories to derive a realistic reward function that balances goal achievement with constraint satisfaction in the application of mobile robotics with dynamic obstacles. To mitigate reward exploitation in such complex settings, we propose a novel two-stage reward curriculum combined with a flexible replay buffer that adaptively samples experiences. Our approach first learns on a subset of rewards before transitioning to the full reward, allowing the agent to learn trade-offs between objectives and constraints. After transitioning to a new stage, our method continues to make use of past experiences by updating their rewards for sample-efficient learning. We investigate the efficacy of our approach in robot navigation tasks and demonstrate superior performance compared to baselines in terms of true reward achievement and task completion, underlining its effectiveness.

Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2410.16790 [cs.LG]
	(or arXiv:2410.16790v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.16790

Submission history

From: Kilian Tamino Freitag [view email]
[v1] Tue, 22 Oct 2024 08:07:44 UTC (6,766 KB)
[v2] Mon, 10 Feb 2025 10:42:49 UTC (3,436 KB)

Computer Science > Machine Learning

Title:Sample-Efficient Curriculum Reinforcement Learning for Complex Reward Functions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sample-Efficient Curriculum Reinforcement Learning for Complex Reward Functions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators