Effort: Efficient Orthogonal Modeling for Generalizable AI-Generated Image Detection

Yan, Zhiyuan; Wang, Jiangming; Wang, Zhendong; Jin, Peng; Zhang, Ke-Yue; Chen, Shen; Yao, Taiping; Ding, Shouhong; Wu, Baoyuan; Yuan, Li

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.15633 (cs)

[Submitted on 23 Nov 2024]

Title:Effort: Efficient Orthogonal Modeling for Generalizable AI-Generated Image Detection

Authors:Zhiyuan Yan, Jiangming Wang, Zhendong Wang, Peng Jin, Ke-Yue Zhang, Shen Chen, Taiping Yao, Shouhong Ding, Baoyuan Wu, Li Yuan

View PDF HTML (experimental)

Abstract:Existing AI-generated image (AIGI) detection methods often suffer from limited generalization performance. In this paper, we identify a crucial yet previously overlooked asymmetry phenomenon in AIGI detection: during training, models tend to quickly overfit to specific fake patterns in the training set, while other information is not adequately captured, leading to poor generalization when faced with new fake methods. A key insight is to incorporate the rich semantic knowledge embedded within large-scale vision foundation models (VFMs) to expand the previous discriminative space (based on forgery patterns only), such that the discrimination is decided by both forgery and semantic cues, thereby reducing the overfitting to specific forgery patterns. A straightforward solution is to fully fine-tune VFMs, but it risks distorting the well-learned semantic knowledge, pushing the model back toward overfitting. To this end, we design a novel approach called Effort: Efficient orthogonal modeling for generalizable AIGI detection. Specifically, we employ Singular Value Decomposition (SVD) to construct the orthogonal semantic and forgery subspaces. By freezing the principal components and adapting the residual components ($\sim$0.19M parameters), we preserve the original semantic subspace and use its orthogonal subspace for learning forgeries. Extensive experiments on AIGI detection benchmarks demonstrate the superior effectiveness of our approach.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2411.15633 [cs.CV]
	(or arXiv:2411.15633v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.15633

Submission history

From: Zhiyuan Yan [view email]
[v1] Sat, 23 Nov 2024 19:10:32 UTC (12,702 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Effort: Efficient Orthogonal Modeling for Generalizable AI-Generated Image Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Effort: Efficient Orthogonal Modeling for Generalizable AI-Generated Image Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators