Word-Embeddings Distinguish Denominal and Root-Derived Verbs in Semitic

Benbaji, Ido; Doron, Omri; Hénot-Mortier, Adèle

doi:10.4204/EPTCS.366.6

Computer Science > Computation and Language

arXiv:2208.05721 (cs)

[Submitted on 11 Aug 2022]

Title:Word-Embeddings Distinguish Denominal and Root-Derived Verbs in Semitic

Authors:Ido Benbaji (MIT), Omri Doron (MIT), Adèle Hénot-Mortier (MIT)

View PDF

Abstract:Proponents of the Distributed Morphology framework have posited the existence of two levels of morphological word formation: a lower one, leading to loose input-output semantic relationships; and an upper one, leading to tight input-output semantic relationships. In this work, we propose to test the validity of this assumption in the context of Hebrew word embeddings. If the two-level hypothesis is borne out, we expect state-of-the-art Hebrew word embeddings to encode (1) a noun, (2) a denominal derived from it (via an upper-level operation), and (3) a verb related to the noun (via a lower-level operation on the noun's root), in such a way that the denominal (2) should be closer in the embedding space to the noun (1) than the related verb (3) is to the same noun (1). We report that this hypothesis is verified by four embedding models of Hebrew: fastText, GloVe, Word2Vec and AlephBERT. This suggests that word embedding models are able to capture complex and fine-grained semantic properties that are morphologically motivated.

Comments:	In Proceedings E2ECOMPVEC, arXiv:2208.05313
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2208.05721 [cs.CL]
	(or arXiv:2208.05721v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2208.05721
Journal reference:	EPTCS 366, 2022, pp. 35-49
Related DOI:	https://doi.org/10.4204/EPTCS.366.6

Submission history

From: EPTCS [view email] [via EPTCS proxy]
[v1] Thu, 11 Aug 2022 09:31:37 UTC (119 KB)

Computer Science > Computation and Language

Title:Word-Embeddings Distinguish Denominal and Root-Derived Verbs in Semitic

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Word-Embeddings Distinguish Denominal and Root-Derived Verbs in Semitic

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators