Raising the Bar of AI-generated Image Detection with CLIP

Cozzolino, Davide; Poggi, Giovanni; Corvi, Riccardo; Nießner, Matthias; Verdoliva, Luisa

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.00195 (cs)

[Submitted on 30 Nov 2023 (v1), last revised 29 Apr 2024 (this version, v2)]

Title:Raising the Bar of AI-generated Image Detection with CLIP

Authors:Davide Cozzolino, Giovanni Poggi, Riccardo Corvi, Matthias Nießner, Luisa Verdoliva

View PDF HTML (experimental)

Abstract:The aim of this work is to explore the potential of pre-trained vision-language models (VLMs) for universal detection of AI-generated images. We develop a lightweight detection strategy based on CLIP features and study its performance in a wide variety of challenging scenarios. We find that, contrary to previous beliefs, it is neither necessary nor convenient to use a large domain-specific dataset for training. On the contrary, by using only a handful of example images from a single generative model, a CLIP-based detector exhibits surprising generalization ability and high robustness across different architectures, including recent commercial tools such as Dalle-3, Midjourney v5, and Firefly. We match the state-of-the-art (SoTA) on in-distribution data and significantly improve upon it in terms of generalization to out-of-distribution data (+6% AUC) and robustness to impaired/laundered data (+13%). Our project is available at this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.00195 [cs.CV]
	(or arXiv:2312.00195v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.00195

Submission history

From: Davide Cozzolino [view email]
[v1] Thu, 30 Nov 2023 21:11:20 UTC (17,175 KB)
[v2] Mon, 29 Apr 2024 14:25:42 UTC (16,449 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Raising the Bar of AI-generated Image Detection with CLIP

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Raising the Bar of AI-generated Image Detection with CLIP

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators