Adversarial Robustness through Dynamic Ensemble Learning

Waghela, Hetvi; Sen, Jaydip; Rakshit, Sneha

Computer Science > Cryptography and Security

arXiv:2412.16254 (cs)

[Submitted on 20 Dec 2024]

Title:Adversarial Robustness through Dynamic Ensemble Learning

Authors:Hetvi Waghela, Jaydip Sen, Sneha Rakshit

View PDF

Abstract:Adversarial attacks pose a significant threat to the reliability of pre-trained language models (PLMs) such as GPT, BERT, RoBERTa, and T5. This paper presents Adversarial Robustness through Dynamic Ensemble Learning (ARDEL), a novel scheme designed to enhance the robustness of PLMs against such attacks. ARDEL leverages the diversity of multiple PLMs and dynamically adjusts the ensemble configuration based on input characteristics and detected adversarial patterns. Key components of ARDEL include a meta-model for dynamic weighting, an adversarial pattern detection module, and adversarial training with regularization techniques. Comprehensive evaluations using standardized datasets and various adversarial attack scenarios demonstrate that ARDEL significantly improves robustness compared to existing methods. By dynamically reconfiguring the ensemble to prioritize the most robust models for each input, ARDEL effectively reduces attack success rates and maintains higher accuracy under adversarial conditions. This work contributes to the broader goal of developing more secure and trustworthy AI systems for real-world NLP applications, offering a practical and scalable solution to enhance adversarial resilience in PLMs.

Comments:	This is the accepted version of our paper for the 2024 IEEE Silchar Subsection Conference (IEEE SILCON24), held from November 15 to 17, 2024, at the National Institute of Technology (NIT), Agartala, India. The paper is 6 pages long and contains 3 Figures and 7 Tables
Subjects:	Cryptography and Security (cs.CR); Computation and Language (cs.CL)
Cite as:	arXiv:2412.16254 [cs.CR]
	(or arXiv:2412.16254v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2412.16254

Submission history

From: Jaydip Sen Prof. [view email]
[v1] Fri, 20 Dec 2024 05:36:19 UTC (336 KB)

Computer Science > Cryptography and Security

Title:Adversarial Robustness through Dynamic Ensemble Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Adversarial Robustness through Dynamic Ensemble Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators