SpanProto: A Two-stage Span-based Prototypical Network for Few-shot Named Entity Recognition

Wang, Jianing; Han, Chengcheng; Wang, Chengyu; Tan, Chuanqi; Qiu, Minghui; Huang, Songfang; Huang, Jun; Gao, Ming

Computer Science > Computation and Language

arXiv:2210.09049 (cs)

[Submitted on 17 Oct 2022 (v1), last revised 21 Nov 2022 (this version, v2)]

Title:SpanProto: A Two-stage Span-based Prototypical Network for Few-shot Named Entity Recognition

Authors:Jianing Wang, Chengcheng Han, Chengyu Wang, Chuanqi Tan, Minghui Qiu, Songfang Huang, Jun Huang, Ming Gao

View PDF

Abstract:Few-shot Named Entity Recognition (NER) aims to identify named entities with very little annotated data. Previous methods solve this problem based on token-wise classification, which ignores the information of entity boundaries, and inevitably the performance is affected by the massive non-entity tokens. To this end, we propose a seminal span-based prototypical network (SpanProto) that tackles few-shot NER via a two-stage approach, including span extraction and mention classification. In the span extraction stage, we transform the sequential tags into a global boundary matrix, enabling the model to focus on the explicit boundary information. For mention classification, we leverage prototypical learning to capture the semantic representations for each labeled span and make the model better adapt to novel-class entities. To further improve the model performance, we split out the false positives generated by the span extractor but not labeled in the current episode set, and then present a margin-based loss to separate them from each prototype region. Experiments over multiple benchmarks demonstrate that our model outperforms strong baselines by a large margin.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.09049 [cs.CL]
	(or arXiv:2210.09049v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.09049

Submission history

From: Jianing Wang [view email]
[v1] Mon, 17 Oct 2022 12:59:33 UTC (821 KB)
[v2] Mon, 21 Nov 2022 07:20:55 UTC (1,107 KB)

Computer Science > Computation and Language

Title:SpanProto: A Two-stage Span-based Prototypical Network for Few-shot Named Entity Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SpanProto: A Two-stage Span-based Prototypical Network for Few-shot Named Entity Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators