Pseudo-OOD training for robust language models

Sundararaman, Dhanasekar; Mehta, Nikhil; Carin, Lawrence

Computer Science > Computation and Language

arXiv:2210.09132 (cs)

[Submitted on 17 Oct 2022]

Title:Pseudo-OOD training for robust language models

Authors:Dhanasekar Sundararaman, Nikhil Mehta, Lawrence Carin

View PDF

Abstract:While pre-trained large-scale deep models have garnered attention as an important topic for many downstream natural language processing (NLP) tasks, such models often make unreliable predictions on out-of-distribution (OOD) inputs. As such, OOD detection is a key component of a reliable machine-learning model for any industry-scale application. Common approaches often assume access to additional OOD samples during the training stage, however, outlier distribution is often unknown in advance. Instead, we propose a post hoc framework called POORE - POsthoc pseudo-Ood REgularization, that generates pseudo-OOD samples using in-distribution (IND) data. The model is fine-tuned by introducing a new regularization loss that separates the embeddings of IND and OOD data, which leads to significant gains on the OOD prediction task during testing. We extensively evaluate our framework on three real-world dialogue systems, achieving new state-of-the-art in OOD detection.

Comments:	Work in progress
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.09132 [cs.CL]
	(or arXiv:2210.09132v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.09132

Submission history

From: Dhanasekar Sundararaman [view email]
[v1] Mon, 17 Oct 2022 14:32:02 UTC (128 KB)

Computer Science > Computation and Language

Title:Pseudo-OOD training for robust language models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Pseudo-OOD training for robust language models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators