Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability

Haldar, Rajdeep; Xing, Yue; Song, Qifan

Computer Science > Machine Learning

arXiv:2403.03967 (cs)

[Submitted on 6 Mar 2024 (v1), last revised 23 Mar 2024 (this version, v2)]

Title:Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability

Authors:Rajdeep Haldar, Yue Xing, Qifan Song

View PDF HTML (experimental)

Abstract:The existence of adversarial attacks on machine learning models imperceptible to a human is still quite a mystery from a theoretical perspective. In this work, we introduce two notions of adversarial attacks: natural or on-manifold attacks, which are perceptible by a human/oracle, and unnatural or off-manifold attacks, which are not. We argue that the existence of the off-manifold attacks is a natural consequence of the dimension gap between the intrinsic and ambient dimensions of the data. For 2-layer ReLU networks, we prove that even though the dimension gap does not affect generalization performance on samples drawn from the observed data space, it makes the clean-trained model more vulnerable to adversarial perturbations in the off-manifold direction of the data space. Our main results provide an explicit relationship between the $\ell_2,\ell_{\infty}$ attack strength of the on/off-manifold attack and the dimension gap.

Comments:	AISTATS 2024
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:2403.03967 [cs.LG]
	(or arXiv:2403.03967v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.03967

Submission history

From: Rajdeep Haldar [view email]
[v1] Wed, 6 Mar 2024 15:41:21 UTC (893 KB)
[v2] Sat, 23 Mar 2024 11:22:00 UTC (893 KB)

Computer Science > Machine Learning

Title:Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators