HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment

Vulić, Ivan; Gerz, Daniela; Kiela, Douwe; Hill, Felix; Korhonen, Anna

Computer Science > Computation and Language

arXiv:1608.02117v1 (cs)

[Submitted on 6 Aug 2016 (this version), latest version 10 May 2017 (v2)]

Title:HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment

Authors:Ivan Vulić, Daniela Gerz, Douwe Kiela, Felix Hill, Anna Korhonen

View PDF

Abstract:We introduce HyperLex - a dataset and evaluation resource that quantifies the extent of of the semantic category membership and lexical entailment (LE) relation between 2,616 concept pairs. Cognitive psychology research has established that category/class membership, and hence LE, is computed in human semantic memory as a gradual rather than binary relation. Nevertheless, most NLP research, and existing large-scale invetories of concept category membership (WordNet, DBPedia, etc.) treat category membership and LE as binary. To address this, we asked hundreds of native English speakers to indicate strength of category membership between a diverse range of concept pairs on a crowdsourcing platform. Our results confirm that category membership and LE are indeed more gradual than binary. We then compare these human judgements with the predictions of automatic systems, which reveals a huge gap between human performance and state-of-the-art LE, distributional and representation learning models, and substantial differences between the models themselves. We discuss a pathway for improving semantic models to overcome this discrepancy, and indicate future application areas for improved graded LE systems.

Comments:	arXiv admin note: text overlap with arXiv:1511.06361, arXiv:1412.6623 by other authors
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1608.02117 [cs.CL]
	(or arXiv:1608.02117v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1608.02117

Submission history

From: Ivan Vulić [view email]
[v1] Sat, 6 Aug 2016 15:29:34 UTC (1,227 KB)
[v2] Wed, 10 May 2017 15:07:53 UTC (1,866 KB)

Computer Science > Computation and Language

Title:HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators