Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models

Ozyurt, Yilmazcan; Feuerriegel, Stefan; Zhang, Ce

Computer Science > Computation and Language

arXiv:2310.11085 (cs)

[Submitted on 17 Oct 2023 (v1), last revised 2 Oct 2024 (this version, v4)]

Title:Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models

Authors:Yilmazcan Ozyurt, Stefan Feuerriegel, Ce Zhang

View PDF HTML (experimental)

Abstract:Document-level relation extraction aims at inferring structured human knowledge from textual documents. State-of-the-art methods for this task use pre-trained language models (LMs) via fine-tuning, yet fine-tuning is computationally expensive and cannot adapt to new relation types or new LMs. As a remedy, we leverage the generalization capabilities of pre-trained LMs and present a novel framework for document-level in-context few-shot relation extraction. Our framework has three strengths: it eliminates the need (1) for named entity recognition and (2) for human annotations of documents, and (3) it can be updated to new LMs without re-training. We evaluate our framework using DocRED, the largest publicly available dataset for document-level relation extraction, and demonstrate that our framework achieves state-of-the-art performance. We further show that our framework actually performs much better than the original labels from the development set of DocRED. Finally, we conduct an extensive benchmark demonstrating the effectiveness of our framework, achieving state-of-the-art results across six relation extraction datasets and outperforming more than 30 baseline methods. Unlike our framework, the baseline methods have large computational overhead (e.g., from fine-tuning). To the best of our knowledge, we are the first to reformulate the document-level relation extraction task as a tailored in-context few-shot learning paradigm.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2310.11085 [cs.CL]
	(or arXiv:2310.11085v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.11085

Submission history

From: Yilmazcan Ozyurt [view email]
[v1] Tue, 17 Oct 2023 09:10:27 UTC (221 KB)
[v2] Fri, 2 Feb 2024 13:50:42 UTC (277 KB)
[v3] Thu, 23 May 2024 08:33:51 UTC (291 KB)
[v4] Wed, 2 Oct 2024 11:35:45 UTC (319 KB)

Computer Science > Computation and Language

Title:Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators