Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation?

Hashimoto, Wataru; Kamigaito, Hidetaka; Watanabe, Taro

Computer Science > Computation and Language

arXiv:2407.02062 (cs)

[Submitted on 2 Jul 2024 (v1), last revised 25 Oct 2024 (this version, v2)]

Title:Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation?

Authors:Wataru Hashimoto, Hidetaka Kamigaito, Taro Watanabe

View PDF HTML (experimental)

Abstract:This work investigates the impact of data augmentation on confidence calibration and uncertainty estimation in Named Entity Recognition (NER) tasks. For the future advance of NER in safety-critical fields like healthcare and finance, it is essential to achieve accurate predictions with calibrated confidence when applying Deep Neural Networks (DNNs), including Pre-trained Language Models (PLMs), as a real-world application. However, DNNs are prone to miscalibration, which limits their applicability. Moreover, existing methods for calibration and uncertainty estimation are computational expensive. Our investigation in NER found that data augmentation improves calibration and uncertainty in cross-genre and cross-lingual setting, especially in-domain setting. Furthermore, we showed that the calibration for NER tends to be more effective when the perplexity of the sentences generated by data augmentation is lower, and that increasing the size of the augmentation further improves calibration and uncertainty.

Comments:	Accepted to EMNLP 2024 main conference
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2407.02062 [cs.CL]
	(or arXiv:2407.02062v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.02062

Submission history

From: Wataru Hashimoto [view email]
[v1] Tue, 2 Jul 2024 08:49:43 UTC (594 KB)
[v2] Fri, 25 Oct 2024 10:07:18 UTC (596 KB)

Computer Science > Computation and Language

Title:Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators