Policy Teaching in Reinforcement Learning via Environment Poisoning Attacks

Rakhsha, Amin; Radanovic, Goran; Devidze, Rati; Zhu, Xiaojin; Singla, Adish

Computer Science > Machine Learning

arXiv:2011.10824 (cs)

[Submitted on 21 Nov 2020]

Title:Policy Teaching in Reinforcement Learning via Environment Poisoning Attacks

Authors:Amin Rakhsha, Goran Radanovic, Rati Devidze, Xiaojin Zhu, Adish Singla

View PDF

Abstract:We study a security threat to reinforcement learning where an attacker poisons the learning environment to force the agent into executing a target policy chosen by the attacker. As a victim, we consider RL agents whose objective is to find a policy that maximizes reward in infinite-horizon problem settings. The attacker can manipulate the rewards and the transition dynamics in the learning environment at training-time, and is interested in doing so in a stealthy manner. We propose an optimization framework for finding an optimal stealthy attack for different measures of attack cost. We provide lower/upper bounds on the attack cost, and instantiate our attacks in two settings: (i) an offline setting where the agent is doing planning in the poisoned environment, and (ii) an online setting where the agent is learning a policy with poisoned feedback. Our results show that the attacker can easily succeed in teaching any target policy to the victim under mild conditions and highlight a significant security threat to reinforcement learning agents in practice.

Comments:	Journal version of ICML'20 paper. New theoretical results for jointly poisoning rewards and transitions
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
Cite as:	arXiv:2011.10824 [cs.LG]
	(or arXiv:2011.10824v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2011.10824

Submission history

From: Adish Singla [view email]
[v1] Sat, 21 Nov 2020 16:54:45 UTC (3,469 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CR

< prev | next >

new | recent | 2020-11

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Goran Radanovic
Rati Devidze
Xiaojin Zhu
Adish Singla

export BibTeX citation

Computer Science > Machine Learning

Title:Policy Teaching in Reinforcement Learning via Environment Poisoning Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Policy Teaching in Reinforcement Learning via Environment Poisoning Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators