ProTransformer: Robustify Transformers via Plug-and-Play Paradigm

Hou, Zhichao; Gao, Weizhi; Shen, Yuchen; Wang, Feiyi; Liu, Xiaorui

Computer Science > Machine Learning

arXiv:2410.23182 (cs)

[Submitted on 30 Oct 2024]

Title:ProTransformer: Robustify Transformers via Plug-and-Play Paradigm

Authors:Zhichao Hou, Weizhi Gao, Yuchen Shen, Feiyi Wang, Xiaorui Liu

View PDF HTML (experimental)

Abstract:Transformer-based architectures have dominated various areas of machine learning in recent years. In this paper, we introduce a novel robust attention mechanism designed to enhance the resilience of transformer-based architectures. Crucially, this technique can be integrated into existing transformers as a plug-and-play layer, improving their robustness without the need for additional training or fine-tuning. Through comprehensive experiments and ablation studies, we demonstrate that our ProTransformer significantly enhances the robustness of transformer models across a variety of prediction tasks, attack mechanisms, backbone architectures, and data domains. Notably, without further fine-tuning, the ProTransformer consistently improves the performance of vanilla transformers by 19.5%, 28.3%, 16.1%, and 11.4% for BERT, ALBERT, DistilBERT, and RoBERTa, respectively, under the classical TextFooler attack. Furthermore, ProTransformer shows promising resilience in large language models (LLMs) against prompting-based attacks, improving the performance of T5 and LLaMA by 24.8% and 17.8%, respectively, and enhancing Vicuna by an average of 10.4% against the Jailbreaking attack. Beyond the language domain, ProTransformer also demonstrates outstanding robustness in both vision and graph domains.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
Cite as:	arXiv:2410.23182 [cs.LG]
	(or arXiv:2410.23182v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.23182

Submission history

From: Zhichao Hou [view email]
[v1] Wed, 30 Oct 2024 16:38:09 UTC (8,383 KB)

Computer Science > Machine Learning

Title:ProTransformer: Robustify Transformers via Plug-and-Play Paradigm

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ProTransformer: Robustify Transformers via Plug-and-Play Paradigm

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators