SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents

Yin, Sheng; Pang, Xianghe; Ding, Yuanzhuo; Chen, Menglan; Bi, Yutong; Xiong, Yichen; Huang, Wenhao; Xiang, Zhen; Shao, Jing; Chen, Siheng

Computer Science > Cryptography and Security

arXiv:2412.13178 (cs)

[Submitted on 17 Dec 2024 (v1), last revised 18 Dec 2024 (this version, v2)]

Title:SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents

Authors:Sheng Yin, Xianghe Pang, Yuanzhuo Ding, Menglan Chen, Yutong Bi, Yichen Xiong, Wenhao Huang, Zhen Xiang, Jing Shao, Siheng Chen

View PDF HTML (experimental)

Abstract:With the integration of large language models (LLMs), embodied agents have strong capabilities to execute complicated instructions in natural language, paving a way for the potential deployment of embodied robots. However, a foreseeable issue is that those embodied agents can also flawlessly execute some hazardous tasks, potentially causing damages in real world. To study this issue, we present SafeAgentBench -- a new benchmark for safety-aware task planning of embodied LLM agents. SafeAgentBench includes: (1) a new dataset with 750 tasks, covering 10 potential hazards and 3 task types; (2) SafeAgentEnv, a universal embodied environment with a low-level controller, supporting multi-agent execution with 17 high-level actions for 8 state-of-the-art baselines; and (3) reliable evaluation methods from both execution and semantic perspectives. Experimental results show that the best-performing baseline gets 69% success rate for safe tasks, but only 5% rejection rate for hazardous tasks, indicating significant safety risks. More details and codes are available at this https URL.

Comments:	21 pages, 14 tables, 7 figures, submitted to ICRA 2024
Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2412.13178 [cs.CR]
	(or arXiv:2412.13178v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2412.13178

Submission history

From: Sheng Yin [view email]
[v1] Tue, 17 Dec 2024 18:55:58 UTC (7,203 KB)
[v2] Wed, 18 Dec 2024 14:00:02 UTC (7,202 KB)

Computer Science > Cryptography and Security

Title:SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators