Linking Surface Facts to Large-Scale Knowledge Graphs

Radevski, Gorjan; Gashteovski, Kiril; Hung, Chia-Chien; Lawrence, Carolin; Glavaš, Goran

Computer Science > Computation and Language

arXiv:2310.14909 (cs)

[Submitted on 23 Oct 2023]

Title:Linking Surface Facts to Large-Scale Knowledge Graphs

Authors:Gorjan Radevski, Kiril Gashteovski, Chia-Chien Hung, Carolin Lawrence, Goran Glavaš

View PDF

Abstract:Open Information Extraction (OIE) methods extract facts from natural language text in the form of ("subject"; "relation"; "object") triples. These facts are, however, merely surface forms, the ambiguity of which impedes their downstream usage; e.g., the surface phrase "Michael Jordan" may refer to either the former basketball player or the university professor. Knowledge Graphs (KGs), on the other hand, contain facts in a canonical (i.e., unambiguous) form, but their coverage is limited by a static schema (i.e., a fixed set of entities and predicates). To bridge this gap, we need the best of both worlds: (i) high coverage of free-text OIEs, and (ii) semantic precision (i.e., monosemy) of KGs. In order to achieve this goal, we propose a new benchmark with novel evaluation protocols that can, for example, measure fact linking performance on a granular triple slot level, while also measuring if a system has the ability to recognize that a surface form has no match in the existing KG. Our extensive evaluation of several baselines show that detection of out-of-KG entities and predicates is more difficult than accurate linking to existing ones, thus calling for more research efforts on this difficult task. We publicly release all resources (data, benchmark and code) on this https URL.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2310.14909 [cs.CL]
	(or arXiv:2310.14909v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.14909

Submission history

From: Kiril Gashteovski [view email]
[v1] Mon, 23 Oct 2023 13:18:49 UTC (8,871 KB)

Computer Science > Computation and Language

Title:Linking Surface Facts to Large-Scale Knowledge Graphs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Linking Surface Facts to Large-Scale Knowledge Graphs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators