CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing

Jin, Haibo; Chen, Ruoxi; Chen, Jinyin; Zheng, Haibin; Zhang, Yang; Wang, Haohan

Computer Science > Cryptography and Security

arXiv:2112.13064 (cs)

[Submitted on 24 Dec 2021 (v1), last revised 17 Jul 2024 (this version, v3)]

Title:CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing

Authors:Haibo Jin, Ruoxi Chen, Jinyin Chen, Haibin Zheng, Yang Zhang, Haohan Wang

View PDF HTML (experimental)

Abstract:The success of deep neural networks (DNNs) in real-world applications has benefited from abundant pre-trained models. However, the backdoored pre-trained models can pose a significant trojan threat to the deployment of downstream DNNs. Numerous backdoor detection methods have been proposed but are limited to two aspects: (1) high sensitivity on trigger size, especially on stealthy attacks (i.e., blending attacks and defense adaptive attacks); (2) rely heavily on benign examples for reverse engineering. To address these challenges, we empirically observed that trojaned behaviors triggered by various trojan attacks can be attributed to the trojan path, composed of top-$k$ critical neurons with more significant contributions to model prediction changes. Motivated by it, we propose CatchBackdoor, a detection method against trojan attacks. Based on the close connection between trojaned behaviors and trojan path to trigger errors, CatchBackdoor starts from the benign path and gradually approximates the trojan path through differential fuzzing. We then reverse triggers from the trojan path, to trigger errors caused by diverse trojaned attacks. Extensive experiments on MINST, CIFAR-10, and a-ImageNet datasets and 7 models (LeNet, ResNet, and VGG) demonstrate the superiority of CatchBackdoor over the state-of-the-art methods, in terms of (1) \emph{effective} - it shows better detection performance, especially on stealthy attacks ($\sim$ $\times$ 2 on average); (2) \emph{extensible} - it is robust to trigger size and can conduct detection without benign examples.

Comments:	35 pages
Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2112.13064 [cs.CR]
	(or arXiv:2112.13064v3 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2112.13064

Submission history

From: Ruoxi Chen [view email]
[v1] Fri, 24 Dec 2021 13:57:03 UTC (8,989 KB)
[v2] Tue, 21 Feb 2023 14:02:52 UTC (1 KB) (withdrawn)
[v3] Wed, 17 Jul 2024 13:58:13 UTC (4,282 KB)

Computer Science > Cryptography and Security

Title:CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators