Augmenting semantic lexicons using word embeddings and transfer learning

Alshaabi, Thayer; Van Oort, Colin M.; Fudolig, Mikaela Irene; Arnold, Michael V.; Danforth, Christopher M.; Dodds, Peter Sheridan

doi:10.3389/frai.2021.783778

Computer Science > Computation and Language

arXiv:2109.09010 (cs)

[Submitted on 18 Sep 2021 (v1), last revised 2 Nov 2021 (this version, v2)]

Title:Augmenting semantic lexicons using word embeddings and transfer learning

Authors:Thayer Alshaabi, Colin M. Van Oort, Mikaela Irene Fudolig, Michael V. Arnold, Christopher M. Danforth, Peter Sheridan Dodds

View PDF

Abstract:Sentiment-aware intelligent systems are essential to a wide array of applications. These systems are driven by language models which broadly fall into two paradigms: Lexicon-based and contextual. Although recent contextual models are increasingly dominant, we still see demand for lexicon-based models because of their interpretability and ease of use. For example, lexicon-based models allow researchers to readily determine which words and phrases contribute most to a change in measured sentiment. A challenge for any lexicon-based approach is that the lexicon needs to be routinely expanded with new words and expressions. Here, we propose two models for automatic lexicon expansion. Our first model establishes a baseline employing a simple and shallow neural network initialized with pre-trained word embeddings using a non-contextual approach. Our second model improves upon our baseline, featuring a deep Transformer-based network that brings to bear word definitions to estimate their lexical polarity. Our evaluation shows that both models are able to score new words with a similar accuracy to reviewers from Amazon Mechanical Turk, but at a fraction of the cost.

Comments:	17 pages, 8 figures
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Physics and Society (physics.soc-ph)
Cite as:	arXiv:2109.09010 [cs.CL]
	(or arXiv:2109.09010v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.09010
Journal reference:	Front. Artif. Intell. 4:783778 (2022)
Related DOI:	https://doi.org/10.3389/frai.2021.783778

Submission history

From: Thayer Alshaabi [view email]
[v1] Sat, 18 Sep 2021 20:59:52 UTC (1,662 KB)
[v2] Tue, 2 Nov 2021 04:27:55 UTC (841 KB)

Computer Science > Computation and Language

Title:Augmenting semantic lexicons using word embeddings and transfer learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Augmenting semantic lexicons using word embeddings and transfer learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators