On Fragile Features and Batch Normalization in Adversarial Training

Walter, Nils Philipp; Stutz, David; Schiele, Bernt

Computer Science > Machine Learning

arXiv:2204.12393 (cs)

[Submitted on 26 Apr 2022]

Title:On Fragile Features and Batch Normalization in Adversarial Training

Authors:Nils Philipp Walter, David Stutz, Bernt Schiele

View PDF

Abstract:Modern deep learning architecture utilize batch normalization (BN) to stabilize training and improve accuracy. It has been shown that the BN layers alone are surprisingly expressive. In the context of robustness against adversarial examples, however, BN is argued to increase vulnerability. That is, BN helps to learn fragile features. Nevertheless, BN is still used in adversarial training, which is the de-facto standard to learn robust features. In order to shed light on the role of BN in adversarial training, we investigate to what extent the expressiveness of BN can be used to robustify fragile features in comparison to random features. On CIFAR10, we find that adversarially fine-tuning just the BN layers can result in non-trivial adversarial robustness. Adversarially training only the BN layers from scratch, in contrast, is not able to convey meaningful adversarial robustness. Our results indicate that fragile features can be used to learn models with moderate adversarial robustness, while random features cannot

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2204.12393 [cs.LG]
	(or arXiv:2204.12393v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2204.12393

Submission history

From: Nils Philipp Walter [view email]
[v1] Tue, 26 Apr 2022 15:49:33 UTC (365 KB)

Computer Science > Machine Learning

Title:On Fragile Features and Batch Normalization in Adversarial Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On Fragile Features and Batch Normalization in Adversarial Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators