Trigger Hunting with a Topological Prior for Trojan Detection

Hu, Xiaoling; Lin, Xiao; Cogswell, Michael; Yao, Yi; Jha, Susmit; Chen, Chao

Computer Science > Computer Vision and Pattern Recognition

arXiv:2110.08335 (cs)

[Submitted on 15 Oct 2021 (v1), last revised 2 Apr 2022 (this version, v2)]

Title:Trigger Hunting with a Topological Prior for Trojan Detection

Authors:Xiaoling Hu, Xiao Lin, Michael Cogswell, Yi Yao, Susmit Jha, Chao Chen

View PDF

Abstract:Despite their success and popularity, deep neural networks (DNNs) are vulnerable when facing backdoor attacks. This impedes their wider adoption, especially in mission critical applications. This paper tackles the problem of Trojan detection, namely, identifying Trojaned models -- models trained with poisoned data. One popular approach is reverse engineering, i.e., recovering the triggers on a clean image by manipulating the model's prediction. One major challenge of reverse engineering approach is the enormous search space of triggers. To this end, we propose innovative priors such as diversity and topological simplicity to not only increase the chances of finding the appropriate triggers but also improve the quality of the found triggers. Moreover, by encouraging a diverse set of trigger candidates, our method can perform effectively in cases with unknown target labels. We demonstrate that these priors can significantly improve the quality of the recovered triggers, resulting in substantially improved Trojan detection accuracy as validated on both synthetic and publicly available TrojAI benchmarks.

Comments:	17 pages, 10 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
Cite as:	arXiv:2110.08335 [cs.CV]
	(or arXiv:2110.08335v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2110.08335

Submission history

From: Xiaoling Hu Mr [view email]
[v1] Fri, 15 Oct 2021 19:47:00 UTC (5,469 KB)
[v2] Sat, 2 Apr 2022 04:36:03 UTC (5,746 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-10

Change to browse by:

cs
cs.CG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xiaoling Hu
Xiao Lin
Michael Cogswell
Yi Yao
Susmit Jha

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Trigger Hunting with a Topological Prior for Trojan Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Trigger Hunting with a Topological Prior for Trojan Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators