Adversarial Training Should Be Cast as a Non-Zero-Sum Game

Robey, Alexander; Latorre, Fabian; Pappas, George J.; Hassani, Hamed; Cevher, Volkan

Computer Science > Machine Learning

arXiv:2306.11035v1 (cs)

[Submitted on 19 Jun 2023 (this version), latest version 18 Mar 2024 (v2)]

Title:Adversarial Training Should Be Cast as a Non-Zero-Sum Game

Authors:Alexander Robey, Fabian Latorre, George J. Pappas, Hamed Hassani, Volkan Cevher

View PDF

Abstract:One prominent approach toward resolving the adversarial vulnerability of deep neural networks is the two-player zero-sum paradigm of adversarial training, in which predictors are trained against adversarially-chosen perturbations of data. Despite the promise of this approach, algorithms based on this paradigm have not engendered sufficient levels of robustness, and suffer from pathological behavior like robust overfitting. To understand this shortcoming, we first show that the commonly used surrogate-based relaxation used in adversarial training algorithms voids all guarantees on the robustness of trained classifiers. The identification of this pitfall informs a novel non-zero-sum bilevel formulation of adversarial training, wherein each player optimizes a different objective function. Our formulation naturally yields a simple algorithmic framework that matches and in some cases outperforms state-of-the-art attacks, attains comparable levels of robustness to standard adversarial training algorithms, and does not suffer from robust overfitting.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2306.11035 [cs.LG]
	(or arXiv:2306.11035v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.11035

Submission history

From: Alexander Robey [view email]
[v1] Mon, 19 Jun 2023 16:00:48 UTC (221 KB)
[v2] Mon, 18 Mar 2024 18:55:44 UTC (456 KB)

Computer Science > Machine Learning

Title:Adversarial Training Should Be Cast as a Non-Zero-Sum Game

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adversarial Training Should Be Cast as a Non-Zero-Sum Game

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators