How Do Your Biomedical Named Entity Models Generalize to Novel Entities?

Kim, Hyunjae; Kang, Jaewoo

Computer Science > Computation and Language

arXiv:2101.00160v2 (cs)

COVID-19 e-print

Important: e-prints posted on arXiv are not peer-reviewed by arXiv; they should not be relied upon without context to guide clinical practice or health-related behavior and should not be reported in news media as established information without consulting multiple experts in the field.

[Submitted on 1 Jan 2021 (v1), revised 18 Aug 2021 (this version, v2), latest version 14 Mar 2022 (v3)]

Title:How Do Your Biomedical Named Entity Models Generalize to Novel Entities?

Authors:Hyunjae Kim, Jaewoo Kang

View PDF

Abstract:The number of biomedical literature on new biomedical concepts is rapidly increasing, which necessitates a reliable biomedical named entity recognition (BioNER) model for identifying new and unseen entity mentions. However, it is questionable whether existing BioNER models can effectively handle them. In this work, we systematically analyze the three types of recognition abilities of BioNER models: memorization, synonym generalization, and concept generalization. We find that although BioNER models achieve state-of-the-art performance on BioNER benchmarks based on overall performance, they have limitations in identifying synonyms and new biomedical concepts such as COVID-19. From this observation, we conclude that existing BioNER models are overestimated in terms of their generalization abilities. Also, we identify several difficulties in recognizing unseen mentions in BioNER and make the following conclusions: (1) BioNER models tend to exploit dataset biases, which hinders the models' abilities to generalize, and (2) several biomedical names have novel morphological patterns with little name regularity such as COVID-19, and models fail to recognize them. We apply a current statistics-based debiasing method to our problem as a simple remedy and show the improvement in generalization to unseen mentions. We hope that our analyses and findings would be able to facilitate further research into the generalization capabilities of NER models in a domain where their reliability is of utmost importance.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2101.00160 [cs.CL]
	(or arXiv:2101.00160v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2101.00160

Submission history

From: Hyunjae Kim [view email]
[v1] Fri, 1 Jan 2021 04:13:42 UTC (49 KB)
[v2] Wed, 18 Aug 2021 06:49:06 UTC (62 KB)
[v3] Mon, 14 Mar 2022 07:50:41 UTC (124 KB)

Computer Science > Computation and Language

Title:How Do Your Biomedical Named Entity Models Generalize to Novel Entities?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:How Do Your Biomedical Named Entity Models Generalize to Novel Entities?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators