Embedding Words as Distributions with a Bayesian Skip-gram Model

Bražinskas, Arthur; Havrylov, Serhii; Titov, Ivan

Computer Science > Computation and Language

arXiv:1711.11027 (cs)

[Submitted on 29 Nov 2017 (v1), last revised 10 Jun 2018 (this version, v2)]

Title:Embedding Words as Distributions with a Bayesian Skip-gram Model

Authors:Arthur Bražinskas, Serhii Havrylov, Ivan Titov

View PDF

Abstract:We introduce a method for embedding words as probability densities in a low-dimensional space. Rather than assuming that a word embedding is fixed across the entire text collection, as in standard word embedding methods, in our Bayesian model we generate it from a word-specific prior density for each occurrence of a given word. Intuitively, for each word, the prior density encodes the distribution of its potential 'meanings'. These prior densities are conceptually similar to Gaussian embeddings. Interestingly, unlike the Gaussian embeddings, we can also obtain context-specific densities: they encode uncertainty about the sense of a word given its context and correspond to posterior distributions within our model. The context-dependent densities have many potential applications: for example, we show that they can be directly used in the lexical substitution task. We describe an effective estimation method based on the variational autoencoding framework. We also demonstrate that our embeddings achieve competitive results on standard benchmarks.

Comments:	COLING 2018. For the associated code, see this https URL
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1711.11027 [cs.CL]
	(or arXiv:1711.11027v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1711.11027

Submission history

From: Serhii Havrylov [view email]
[v1] Wed, 29 Nov 2017 18:55:48 UTC (2,025 KB)
[v2] Sun, 10 Jun 2018 15:44:44 UTC (2,546 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2017-11

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Arthur Brazinskas
Serhii Havrylov
Ivan Titov

export BibTeX citation

Computer Science > Computation and Language

Title:Embedding Words as Distributions with a Bayesian Skip-gram Model

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Embedding Words as Distributions with a Bayesian Skip-gram Model

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators