Data Augmentation for Intent Classification

Chen, Derek; Yin, Claire

Computer Science > Computation and Language

arXiv:2206.05790 (cs)

[Submitted on 12 Jun 2022]

Title:Data Augmentation for Intent Classification

Authors:Derek Chen, Claire Yin

View PDF

Abstract:Training accurate intent classifiers requires labeled data, which can be costly to obtain. Data augmentation methods may ameliorate this issue, but the quality of the generated data varies significantly across techniques. We study the process of systematically producing pseudo-labeled data given a small seed set using a wide variety of data augmentation techniques, including mixing methods together. We find that while certain methods dramatically improve qualitative and quantitative performance, other methods have minimal or even negative impact. We also analyze key considerations when implementing data augmentation methods in production.

Comments:	8 pages, 3 tables. Accepted to NeurIPs 2021 Data-centric AI Workshop
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2206.05790 [cs.CL]
	(or arXiv:2206.05790v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2206.05790

Submission history

From: Derek Chen [view email]
[v1] Sun, 12 Jun 2022 16:56:31 UTC (35 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-06

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Data Augmentation for Intent Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Data Augmentation for Intent Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators