Post-train Black-box Defense via Bayesian Boundary Correction

Wang, He; Diao, Yunfeng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.16979v3 (cs)

[Submitted on 29 Jun 2023 (v1), last revised 11 Jun 2024 (this version, v3)]

Title:Post-train Black-box Defense via Bayesian Boundary Correction

Authors:He Wang, Yunfeng Diao

View PDF HTML (experimental)

Abstract:Classifiers based on deep neural networks are susceptible to adversarial attack, where the widely existing vulnerability has invoked the research in defending them from potential threats. Given a vulnerable classifier, existing defense methods are mostly white-box and often require re-training the victim under modified loss functions/training regimes. While the model/data/training specifics of the victim are usually unavailable to the user, re-training is unappealing, if not impossible for reasons such as limited computational resources. To this end, we propose a new post-train black-box defense framework. It can turn any pre-trained classifier into a resilient one with little knowledge of the model specifics. This is achieved by new joint Bayesian treatments on the clean data, the adversarial examples and the classifier, for maximizing their joint probability. It is further equipped with a new post-train strategy which keeps the victim intact, avoiding re-training. We name our framework Bayesian Boundary Correction (BBC). BBC is a general and flexible framework that can easily adapt to different data types. We instantiate BBC for image classification and skeleton-based human activity recognition, for both static and dynamic data. Exhaustive evaluation shows that BBC has superior robustness and can enhance robustness without severely hurting the clean accuracy, compared with existing defense methods.

Comments:	arXiv admin note: text overlap with arXiv:2203.04713
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
Cite as:	arXiv:2306.16979 [cs.CV]
	(or arXiv:2306.16979v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.16979

Submission history

From: Yunfeng Diao [view email]
[v1] Thu, 29 Jun 2023 14:33:20 UTC (7,671 KB)
[v2] Tue, 17 Oct 2023 02:06:18 UTC (7,671 KB)
[v3] Tue, 11 Jun 2024 07:14:18 UTC (8,910 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Post-train Black-box Defense via Bayesian Boundary Correction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Post-train Black-box Defense via Bayesian Boundary Correction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators