Posterior Differential Regularization with f-divergence for Improving Model Robustness

Cheng, Hao; Liu, Xiaodong; Pereira, Lis; Yu, Yaoliang; Gao, Jianfeng

Computer Science > Computation and Language

arXiv:2010.12638 (cs)

[Submitted on 23 Oct 2020 (v1), last revised 12 Apr 2021 (this version, v2)]

Title:Posterior Differential Regularization with f-divergence for Improving Model Robustness

Authors:Hao Cheng, Xiaodong Liu, Lis Pereira, Yaoliang Yu, Jianfeng Gao

View PDF

Abstract:We address the problem of enhancing model robustness through regularization. Specifically, we focus on methods that regularize the model posterior difference between clean and noisy inputs. Theoretically, we provide a connection of two recent methods, Jacobian Regularization and Virtual Adversarial Training, under this framework. Additionally, we generalize the posterior differential regularization to the family of $f$-divergences and characterize the overall regularization framework in terms of Jacobian matrix. Empirically, we systematically compare those regularizations and standard BERT training on a diverse set of tasks to provide a comprehensive profile of their effect on model in-domain and out-of-domain generalization. For both fully supervised and semi-supervised settings, our experiments show that regularizing the posterior differential with $f$-divergence can result in well-improved model robustness. In particular, with a proper $f$-divergence, a BERT-base model can achieve comparable generalization as its BERT-large counterpart for in-domain, adversarial and domain shift scenarios, indicating the great potential of the proposed framework for boosting model generalization for NLP models.

Comments:	NAACL 2021
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2010.12638 [cs.CL]
	(or arXiv:2010.12638v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2010.12638

Submission history

From: Hao Cheng [view email]
[v1] Fri, 23 Oct 2020 19:58:01 UTC (622 KB)
[v2] Mon, 12 Apr 2021 17:22:04 UTC (133 KB)

Computer Science > Computation and Language

Title:Posterior Differential Regularization with f-divergence for Improving Model Robustness

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Posterior Differential Regularization with f-divergence for Improving Model Robustness

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators