UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

Cheng, Siyuan; Shen, Guangyu; Zhang, Kaiyuan; Tao, Guanhong; An, Shengwei; Guo, Hanxi; Ma, Shiqing; Zhang, Xiangyu

Computer Science > Cryptography and Security

arXiv:2407.11372 (cs)

[Submitted on 16 Jul 2024]

Title:UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

Authors:Siyuan Cheng, Guangyu Shen, Kaiyuan Zhang, Guanhong Tao, Shengwei An, Hanxi Guo, Shiqing Ma, Xiangyu Zhang

View PDF HTML (experimental)

Abstract:Deep neural networks (DNNs) have demonstrated effectiveness in various fields. However, DNNs are vulnerable to backdoor attacks, which inject a unique pattern, called trigger, into the input to cause misclassification to an attack-chosen target label. While existing works have proposed various methods to mitigate backdoor effects in poisoned models, they tend to be less effective against recent advanced attacks. In this paper, we introduce a novel post-training defense technique UNIT that can effectively eliminate backdoor effects for a variety of attacks. In specific, UNIT approximates a unique and tight activation distribution for each neuron in the model. It then proactively dispels substantially large activation values that exceed the approximated boundaries. Our experimental results demonstrate that UNIT outperforms 7 popular defense methods against 14 existing backdoor attacks, including 2 advanced attacks, using only 5\% of clean training data. UNIT is also cost efficient. The code is accessible at this https URL.

Comments:	The 18th European Conference on Computer Vision ECCV 2024
Subjects:	Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.11372 [cs.CR]
	(or arXiv:2407.11372v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2407.11372

Submission history

From: Siyuan Cheng [view email]
[v1] Tue, 16 Jul 2024 04:33:05 UTC (2,965 KB)

Computer Science > Cryptography and Security

Title:UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators