Exploring the Adversarial Robustness of CLIP for AI-generated Image Detection

De Rosa, Vincenzo; Guillaro, Fabrizio; Poggi, Giovanni; Cozzolino, Davide; Verdoliva, Luisa

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.19553 (cs)

[Submitted on 28 Jul 2024 (v1), last revised 23 Oct 2024 (this version, v2)]

Title:Exploring the Adversarial Robustness of CLIP for AI-generated Image Detection

Authors:Vincenzo De Rosa, Fabrizio Guillaro, Giovanni Poggi, Davide Cozzolino, Luisa Verdoliva

View PDF HTML (experimental)

Abstract:In recent years, many forensic detectors have been proposed to detect AI-generated images and prevent their use for malicious purposes. Convolutional neural networks (CNNs) have long been the dominant architecture in this field and have been the subject of intense study. However, recently proposed Transformer-based detectors have been shown to match or even outperform CNN-based detectors, especially in terms of generalization. In this paper, we study the adversarial robustness of AI-generated image detectors, focusing on Contrastive Language-Image Pretraining (CLIP)-based methods that rely on Visual Transformer (ViT) backbones and comparing their performance with CNN-based methods. We study the robustness to different adversarial attacks under a variety of conditions and analyze both numerical results and frequency-domain patterns. CLIP-based detectors are found to be vulnerable to white-box attacks just like CNN-based detectors. However, attacks do not easily transfer between CNN-based and CLIP-based methods. This is also confirmed by the different distribution of the adversarial noise patterns in the frequency domain. Overall, this analysis provides new insights into the properties of forensic detectors that can help to develop more effective strategies.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.19553 [cs.CV]
	(or arXiv:2407.19553v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.19553

Submission history

From: Davide Cozzolino [view email]
[v1] Sun, 28 Jul 2024 18:20:08 UTC (3,436 KB)
[v2] Wed, 23 Oct 2024 16:06:32 UTC (2,954 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Exploring the Adversarial Robustness of CLIP for AI-generated Image Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Exploring the Adversarial Robustness of CLIP for AI-generated Image Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators