A Backdoor Attack Scheme with Invisible Triggers Based on Model Architecture Modification

Ma, Yuan; Ma, Xu; Wei, Jiankang; Tang, Jinmeng; Zhang, Xiaoyu; Lyu, Yilun; Chen, Kehao; Huang, Jingtong

Computer Science > Cryptography and Security

arXiv:2412.16905 (cs)

[Submitted on 22 Dec 2024]

Title:A Backdoor Attack Scheme with Invisible Triggers Based on Model Architecture Modification

Authors:Yuan Ma, Xu Ma, Jiankang Wei, Jinmeng Tang, Xiaoyu Zhang, Yilun Lyu, Kehao Chen, Jingtong Huang

View PDF HTML (experimental)

Abstract:Machine learning systems are vulnerable to backdoor attacks, where attackers manipulate model behavior through data tampering or architectural modifications. Traditional backdoor attacks involve injecting malicious samples with specific triggers into the training data, causing the model to produce targeted incorrect outputs in the presence of the corresponding triggers. More sophisticated attacks modify the model's architecture directly, embedding backdoors that are harder to detect as they evade traditional data-based detection methods. However, the drawback of the architectural modification based backdoor attacks is that the trigger must be visible in order to activate the backdoor. To further strengthen the invisibility of the backdoor attacks, a novel backdoor attack method is presented in the paper. To be more specific, this method embeds the backdoor within the model's architecture and has the capability to generate inconspicuous and stealthy triggers. The attack is implemented by modifying pre-trained models, which are then redistributed, thereby posing a potential threat to unsuspecting users. Comprehensive experiments conducted on standard computer vision benchmarks validate the effectiveness of this attack and highlight the stealthiness of its triggers, which remain undetectable through both manual visual inspection and advanced detection tools.

Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2412.16905 [cs.CR]
	(or arXiv:2412.16905v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2412.16905

Submission history

From: Yuan Ma [view email]
[v1] Sun, 22 Dec 2024 07:39:43 UTC (625 KB)

Computer Science > Cryptography and Security

Title:A Backdoor Attack Scheme with Invisible Triggers Based on Model Architecture Modification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:A Backdoor Attack Scheme with Invisible Triggers Based on Model Architecture Modification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators