Stealing and Evading Malware Classifiers and Antivirus at Low False Positive Conditions

Rigaki, Maria; Garcia, Sebastian

doi:10.1016/j.cose.2023.103192

Computer Science > Cryptography and Security

arXiv:2204.06241 (cs)

[Submitted on 13 Apr 2022 (v1), last revised 4 Jun 2023 (this version, v2)]

Title:Stealing and Evading Malware Classifiers and Antivirus at Low False Positive Conditions

Authors:Maria Rigaki, Sebastian Garcia

View PDF

Abstract:Model stealing attacks have been successfully used in many machine learning domains, but there is little understanding of how these attacks work against models that perform malware detection. Malware detection and, in general, security domains have unique conditions. In particular, there are very strong requirements for low false positive rates (FPR). Antivirus products (AVs) that use machine learning are very complex systems to steal, malware binaries continually change, and the whole environment is adversarial by nature. This study evaluates active learning model stealing attacks against publicly available stand-alone machine learning malware classifiers and also against antivirus products. The study proposes a new neural network architecture for surrogate models (dualFFNN) and a new model stealing attack that combines transfer and active learning for surrogate creation (FFNN-TL). We achieved good surrogates of the stand-alone classifiers with up to 99\% agreement with the target models, using less than 4% of the original training dataset. Good surrogates of AV systems were also trained with up to 99% agreement and less than 4,000 queries. The study uses the best surrogates to generate adversarial malware to evade the target models, both stand-alone and AVs (with and without an internet connection). Results show that surrogate models can generate adversarial malware that evades the targets but with a lower success rate than directly using the target models to generate adversarial malware. Using surrogates, however, is still a good option since using the AVs for malware generation is highly time-consuming and easily detected when the AVs are connected to the internet.

Comments:	20 pages, 10 figures, 8 tables. Accepted, please use the DOI/ journal for citations
Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2204.06241 [cs.CR]
	(or arXiv:2204.06241v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2204.06241
Journal reference:	Computers & Security, Volume 129, June 2023, 103192
Related DOI:	https://doi.org/10.1016/j.cose.2023.103192

Submission history

From: Maria Rigaki [view email]
[v1] Wed, 13 Apr 2022 08:20:41 UTC (5,893 KB)
[v2] Sun, 4 Jun 2023 11:00:50 UTC (5,554 KB)

Computer Science > Cryptography and Security

Title:Stealing and Evading Malware Classifiers and Antivirus at Low False Positive Conditions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Stealing and Evading Malware Classifiers and Antivirus at Low False Positive Conditions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators