Classifying Deepfakes Using Swin Transformers

Xi, Aprille J.; Chen, Eason

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.15656 (cs)

[Submitted on 26 Jan 2025 (v1), last revised 31 Jan 2025 (this version, v2)]

Title:Classifying Deepfakes Using Swin Transformers

Authors:Aprille J. Xi, Eason Chen

View PDF HTML (experimental)

Abstract:The proliferation of deepfake technology poses significant challenges to the authenticity and trustworthiness of digital media, necessitating the development of robust detection methods. This study explores the application of Swin Transformers, a state-of-the-art architecture leveraging shifted windows for self-attention, in detecting and classifying deepfake images. Using the Real and Fake Face Detection dataset by Yonsei University's Computational Intelligence Photography Lab, we evaluate the Swin Transformer and hybrid models such as Swin-ResNet and Swin-KNN, focusing on their ability to identify subtle manipulation artifacts. Our results demonstrate that the Swin Transformer outperforms conventional CNN-based architectures, including VGG16, ResNet18, and AlexNet, achieving a test accuracy of 71.29%. Additionally, we present insights into hybrid model design, highlighting the complementary strengths of transformer and CNN-based approaches in deepfake detection. This study underscores the potential of transformer-based architectures for improving accuracy and generalizability in image-based manipulation detection, paving the way for more effective countermeasures against deepfake threats.

Comments:	3 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.15656 [cs.CV]
	(or arXiv:2501.15656v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.15656

Submission history

From: Eason Chen [view email]
[v1] Sun, 26 Jan 2025 19:35:46 UTC (952 KB)
[v2] Fri, 31 Jan 2025 16:16:30 UTC (952 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Classifying Deepfakes Using Swin Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Classifying Deepfakes Using Swin Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators