HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents

Tomilin, Tristan; Fang, Meng; Pechenizkiy, Mykola

Computer Science > Artificial Intelligence

arXiv:2503.08241 (cs)

[Submitted on 11 Mar 2025]

Title:HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents

Authors:Tristan Tomilin, Meng Fang, Mykola Pechenizkiy

View PDF HTML (experimental)

Abstract:Advancing safe autonomous systems through reinforcement learning (RL) requires robust benchmarks to evaluate performance, analyze methods, and assess agent competencies. Humans primarily rely on embodied visual perception to safely navigate and interact with their surroundings, making it a valuable capability for RL agents. However, existing vision-based 3D benchmarks only consider simple navigation tasks. To address this shortcoming, we introduce \textbf{HASARD}, a suite of diverse and complex tasks to $\textbf{HA}$rness $\textbf{SA}$fe $\textbf{R}$L with $\textbf{D}$oom, requiring strategic decision-making, comprehending spatial relationships, and predicting the short-term future. HASARD features three difficulty levels and two action spaces. An empirical evaluation of popular baseline methods demonstrates the benchmark's complexity, unique challenges, and reward-cost trade-offs. Visualizing agent navigation during training with top-down heatmaps provides insight into a method's learning process. Incrementally training across difficulty levels offers an implicit learning curriculum. HASARD is the first safe RL benchmark to exclusively target egocentric vision-based learning, offering a cost-effective and insightful way to explore the potential and boundaries of current and future safe RL methods. The environments and baseline implementations are open-sourced at this https URL.

Comments:	Accepted to ICLR 2025
Subjects:	Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2503.08241 [cs.AI]
	(or arXiv:2503.08241v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2503.08241

Submission history

From: Tristan Tomilin [view email]
[v1] Tue, 11 Mar 2025 10:05:01 UTC (32,675 KB)

Computer Science > Artificial Intelligence

Title:HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators