Adversarial Machine Learning: Attacking and Safeguarding Image Datasets

Chowdhury, Koushik

Computer Science > Machine Learning

arXiv:2502.05203 (cs)

[Submitted on 31 Jan 2025]

Title:Adversarial Machine Learning: Attacking and Safeguarding Image Datasets

Authors:Koushik Chowdhury

View PDF HTML (experimental)

Abstract:This paper examines the vulnerabilities of convolutional neural networks (CNNs) to adversarial attacks and explores a method for their safeguarding. In this study, CNNs were implemented on four of the most common image datasets, namely CIFAR-10, ImageNet, MNIST, and Fashion-MNIST, and achieved high baseline accuracy. To assess the strength of these models, the Fast Gradient Sign Method was used, which is a type of exploit on the model that is used to bring down the models accuracies by adding a very minimal perturbation to the input image. To counter the FGSM attack, a safeguarding approach went through, which includes retraining the models on clear and pollutant or adversarial images to increase their resistance ability. The next step involves applying FGSM again, but this time to the adversarially trained models, to see how much the accuracy of the models has gone down and evaluate the effectiveness of the defense. It appears that while most level of robustness is achieved against the models after adversarial training, there are still a few losses in the performance of these models against adversarial perturbations. This work emphasizes the need to create better defenses for models deployed in real-world scenarios against adversaries.

Comments:	6 pages, published in Proceedings of the Fourth International Conference on Ubiquitous Computing and Intelligent Information Systems (ICUIS-2024)
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2502.05203 [cs.LG]
	(or arXiv:2502.05203v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.05203
Journal reference:	Proceedings of the Fourth International Conference on Ubiquitous Computing and Intelligent Information Systems (ICUIS-2024)

Submission history

From: Koushik Chowdhury [view email]
[v1] Fri, 31 Jan 2025 22:32:38 UTC (234 KB)

Computer Science > Machine Learning

Title:Adversarial Machine Learning: Attacking and Safeguarding Image Datasets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adversarial Machine Learning: Attacking and Safeguarding Image Datasets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators