Greedy Attack and Gumbel Attack: Generating Adversarial Examples for Discrete Data

Yang, Puyudi; Chen, Jianbo; Hsieh, Cho-Jui; Wang, Jane-Ling; Jordan, Michael I.

Computer Science > Machine Learning

arXiv:1805.12316 (cs)

[Submitted on 31 May 2018]

Title:Greedy Attack and Gumbel Attack: Generating Adversarial Examples for Discrete Data

Authors:Puyudi Yang, Jianbo Chen, Cho-Jui Hsieh, Jane-Ling Wang, Michael I. Jordan

View PDF

Abstract:We present a probabilistic framework for studying adversarial attacks on discrete data. Based on this framework, we derive a perturbation-based method, Greedy Attack, and a scalable learning-based method, Gumbel Attack, that illustrate various tradeoffs in the design of attacks. We demonstrate the effectiveness of these methods using both quantitative metrics and human evaluation on various state-of-the-art models for text classification, including a word-based CNN, a character-based CNN and an LSTM. As as example of our results, we show that the accuracy of character-based convolutional networks drops to the level of random selection by modifying only five characters through Greedy Attack.

Comments:	The first two authors contributed equally
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:1805.12316 [cs.LG]
	(or arXiv:1805.12316v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.12316

Submission history

From: Puyudi Yang [view email]
[v1] Thu, 31 May 2018 04:40:32 UTC (245 KB)

Computer Science > Machine Learning

Title:Greedy Attack and Gumbel Attack: Generating Adversarial Examples for Discrete Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Greedy Attack and Gumbel Attack: Generating Adversarial Examples for Discrete Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators