Improving Sparse Word Representations with Distributional Inference for Semantic Composition

Kober, Thomas; Weeds, Julie; Reffin, Jeremy; Weir, David

Computer Science > Computation and Language

arXiv:1608.06794 (cs)

[Submitted on 24 Aug 2016]

Title:Improving Sparse Word Representations with Distributional Inference for Semantic Composition

Authors:Thomas Kober, Julie Weeds, Jeremy Reffin, David Weir

View PDF

Abstract:Distributional models are derived from co-occurrences in a corpus, where only a small proportion of all possible plausible co-occurrences will be observed. This results in a very sparse vector space, requiring a mechanism for inferring missing knowledge. Most methods face this challenge in ways that render the resulting word representations uninterpretable, with the consequence that semantic composition becomes hard to model. In this paper we explore an alternative which involves explicitly inferring unobserved co-occurrences using the distributional neighbourhood. We show that distributional inference improves sparse word representations on several word similarity benchmarks and demonstrate that our model is competitive with the state-of-the-art for adjective-noun, noun-noun and verb-object compositions while being fully interpretable.

Comments:	To appear at EMNLP 2016
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1608.06794 [cs.CL]
	(or arXiv:1608.06794v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1608.06794

Submission history

From: Thomas Kober [view email]
[v1] Wed, 24 Aug 2016 12:38:45 UTC (53 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2016-08

Change to browse by:

cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

Thomas Kober
Julie Weeds
Jeremy Reffin
David J. Weir

export BibTeX citation

Computer Science > Computation and Language

Title:Improving Sparse Word Representations with Distributional Inference for Semantic Composition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Improving Sparse Word Representations with Distributional Inference for Semantic Composition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators