Achieving Robustness in the Wild via Adversarial Mixing with Disentangled Representations

Gowal, Sven; Qin, Chongli; Huang, Po-Sen; Cemgil, Taylan; Dvijotham, Krishnamurthy; Mann, Timothy; Kohli, Pushmeet

Computer Science > Machine Learning

arXiv:1912.03192 (cs)

[Submitted on 6 Dec 2019 (v1), last revised 25 Mar 2020 (this version, v2)]

Title:Achieving Robustness in the Wild via Adversarial Mixing with Disentangled Representations

Authors:Sven Gowal, Chongli Qin, Po-Sen Huang, Taylan Cemgil, Krishnamurthy Dvijotham, Timothy Mann, Pushmeet Kohli

View PDF

Abstract:Recent research has made the surprising finding that state-of-the-art deep learning models sometimes fail to generalize to small variations of the input. Adversarial training has been shown to be an effective approach to overcome this problem. However, its application has been limited to enforcing invariance to analytically defined transformations like $\ell_p$-norm bounded perturbations. Such perturbations do not necessarily cover plausible real-world variations that preserve the semantics of the input (such as a change in lighting conditions). In this paper, we propose a novel approach to express and formalize robustness to these kinds of real-world transformations of the input. The two key ideas underlying our formulation are (1) leveraging disentangled representations of the input to define different factors of variations, and (2) generating new input images by adversarially composing the representations of different images. We use a StyleGAN model to demonstrate the efficacy of this framework. Specifically, we leverage the disentangled latent representations computed by a StyleGAN model to generate perturbations of an image that are similar to real-world variations (like adding make-up, or changing the skin-tone of a person) and train models to be invariant to these perturbations. Extensive experiments show that our method improves generalization and reduces the effect of spurious correlations (reducing the error rate of a "smile" detector by 21% for example).

Comments:	Accepted at CVPR 2020
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1912.03192 [cs.LG]
	(or arXiv:1912.03192v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1912.03192

Submission history

From: Sven Gowal [view email]
[v1] Fri, 6 Dec 2019 15:56:53 UTC (4,146 KB)
[v2] Wed, 25 Mar 2020 09:33:57 UTC (4,820 KB)

Computer Science > Machine Learning

Title:Achieving Robustness in the Wild via Adversarial Mixing with Disentangled Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Achieving Robustness in the Wild via Adversarial Mixing with Disentangled Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators