Modelling General Properties of Nouns by Selectively Averaging Contextualised Embeddings

Li, Na; Bouraoui, Zied; Collados, Jose Camacho; Espinosa-Anke, Luis; Gu, Qing; Schockaert, Steven

Computer Science > Computation and Language

arXiv:2012.07580 (cs)

[Submitted on 4 Dec 2020 (v1), last revised 17 May 2021 (this version, v2)]

Title:Modelling General Properties of Nouns by Selectively Averaging Contextualised Embeddings

Authors:Na Li, Zied Bouraoui, Jose Camacho Collados, Luis Espinosa-Anke, Qing Gu, Steven Schockaert

View PDF

Abstract:While the success of pre-trained language models has largely eliminated the need for high-quality static word vectors in many NLP applications, such vectors continue to play an important role in tasks where words need to be modelled in the absence of linguistic context. In this paper, we explore how the contextualised embeddings predicted by BERT can be used to produce high-quality word vectors for such domains, in particular related to knowledge base completion, where our focus is on capturing the semantic properties of nouns. We find that a simple strategy of averaging the contextualised embeddings of masked word mentions leads to vectors that outperform the static word vectors learned by BERT, as well as those from standard word embedding models, in property induction tasks. We notice in particular that masking target words is critical to achieve this strong performance, as the resulting vectors focus less on idiosyncratic properties and more on general semantic properties. Inspired by this view, we propose a filtering strategy which is aimed at removing the most idiosyncratic mention vectors, allowing us to obtain further performance gains in property induction.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2012.07580 [cs.CL]
	(or arXiv:2012.07580v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2012.07580

Submission history

From: Zied Bouraoui [view email]
[v1] Fri, 4 Dec 2020 14:03:03 UTC (1,425 KB)
[v2] Mon, 17 May 2021 15:00:19 UTC (1,664 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-12

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Na Li
Zied Bouraoui
José Camacho-Collados
Luis Espinosa Anke
Qing Gu

…

export BibTeX citation

Computer Science > Computation and Language

Title:Modelling General Properties of Nouns by Selectively Averaging Contextualised Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Modelling General Properties of Nouns by Selectively Averaging Contextualised Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators