Sentiment Analysis for Sinhala Language using Deep Learning Techniques

Senevirathne, Lahiru; Demotte, Piyumal; Karunanayake, Binod; Munasinghe, Udyogi; Ranathunga, Surangika

Computer Science > Computation and Language

arXiv:2011.07280 (cs)

[Submitted on 14 Nov 2020]

Title:Sentiment Analysis for Sinhala Language using Deep Learning Techniques

Authors:Lahiru Senevirathne, Piyumal Demotte, Binod Karunanayake, Udyogi Munasinghe, Surangika Ranathunga

View PDF

Abstract:Due to the high impact of the fast-evolving fields of machine learning and deep learning, Natural Language Processing (NLP) tasks have further obtained comprehensive performances for highly resourced languages such as English and Chinese. However Sinhala, which is an under-resourced language with a rich morphology, has not experienced these advancements. For sentiment analysis, there exists only two previous research with deep learning approaches, which focused only on document-level sentiment analysis for the binary case. They experimented with only three types of deep learning models. In contrast, this paper presents a much comprehensive study on the use of standard sequence models such as RNN, LSTM, Bi-LSTM, as well as more recent state-of-the-art models such as hierarchical attention hybrid neural networks, and capsule networks. Classification is done at document-level but with more granularity by considering POSITIVE, NEGATIVE, NEUTRAL, and CONFLICT classes. A data set of 15059 Sinhala news comments, annotated with these four classes and a corpus consists of 9.48 million tokens are publicly released. This is the largest sentiment annotated data set for Sinhala so far.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
ACM classes:	I.2.6; I.2.7
Cite as:	arXiv:2011.07280 [cs.CL]
	(or arXiv:2011.07280v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2011.07280

Submission history

From: Piyumal Demotte [view email]
[v1] Sat, 14 Nov 2020 12:02:30 UTC (1,443 KB)

Computer Science > Computation and Language

Title:Sentiment Analysis for Sinhala Language using Deep Learning Techniques

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Sentiment Analysis for Sinhala Language using Deep Learning Techniques

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators