CONTaiNER: Few-Shot Named Entity Recognition via Contrastive Learning

Das, Sarkar Snigdha Sarathi; Katiyar, Arzoo; Passonneau, Rebecca J.; Zhang, Rui

Computer Science > Computation and Language

arXiv:2109.07589 (cs)

[Submitted on 15 Sep 2021 (v1), last revised 28 Mar 2022 (this version, v2)]

Title:CONTaiNER: Few-Shot Named Entity Recognition via Contrastive Learning

Authors:Sarkar Snigdha Sarathi Das, Arzoo Katiyar, Rebecca J. Passonneau, Rui Zhang

View PDF

Abstract:Named Entity Recognition (NER) in Few-Shot setting is imperative for entity tagging in low resource domains. Existing approaches only learn class-specific semantic features and intermediate representations from source domains. This affects generalizability to unseen target domains, resulting in suboptimal performances. To this end, we present CONTaiNER, a novel contrastive learning technique that optimizes the inter-token distribution distance for Few-Shot NER. Instead of optimizing class-specific attributes, CONTaiNER optimizes a generalized objective of differentiating between token categories based on their Gaussian-distributed embeddings. This effectively alleviates overfitting issues originating from training domains. Our experiments in several traditional test domains (OntoNotes, CoNLL'03, WNUT '17, GUM) and a new large scale Few-Shot NER dataset (Few-NERD) demonstrate that on average, CONTaiNER outperforms previous methods by 3%-13% absolute F1 points while showing consistent performance trends, even in challenging scenarios where previous approaches could not achieve appreciable performance.

Comments:	Accepted by ACL 2022 (Main Conference, Long Paper)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2109.07589 [cs.CL]
	(or arXiv:2109.07589v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.07589

Submission history

From: Sarkar Snigdha Sarathi Das [view email]
[v1] Wed, 15 Sep 2021 21:41:16 UTC (245 KB)
[v2] Mon, 28 Mar 2022 06:47:40 UTC (603 KB)

Computer Science > Computation and Language

Title:CONTaiNER: Few-Shot Named Entity Recognition via Contrastive Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CONTaiNER: Few-Shot Named Entity Recognition via Contrastive Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators