A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews

Lakatos, Robert; Bogacsovics, Gergo; Harangi, Balazs; Lakatos, Istvan; Tiba, Attila; Toth, Janos; Szabo, Marianna; Hajdu, Andras

doi:10.3390/bdcc8030020

Computer Science > Computation and Language

arXiv:2306.07786 (cs)

[Submitted on 13 Jun 2023 (v1), last revised 18 Jun 2023 (this version, v2)]

Title:A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews

Authors:Robert Lakatos, Gergo Bogacsovics, Balazs Harangi, Istvan Lakatos, Attila Tiba, Janos Toth, Marianna Szabo, Andras Hajdu

View PDF

Abstract:The efficiency of natural language processing has improved dramatically with the advent of machine learning models, particularly neural network-based solutions. However, some tasks are still challenging, especially when considering specific domains. In this paper, we present a cloud-based system that can extract insights from customer reviews using machine learning methods integrated into a pipeline. For topic modeling, our composite model uses transformer-based neural networks designed for natural language processing, vector embedding-based keyword extraction, and clustering. The elements of our model have been integrated and further developed to meet better the requirements of efficient information extraction, topic modeling of the extracted information, and user needs. Furthermore, our system can achieve better results than this task's existing topic modeling and keyword extraction solutions. Our approach is validated and compared with other state-of-the-art methods using publicly available datasets for benchmarking.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.07786 [cs.CL]
	(or arXiv:2306.07786v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2306.07786
Related DOI:	https://doi.org/10.3390/bdcc8030020

Submission history

From: Róbert Lakatos [view email]
[v1] Tue, 13 Jun 2023 14:07:52 UTC (1,134 KB)
[v2] Sun, 18 Jun 2023 10:56:14 UTC (1,133 KB)

Computer Science > Computation and Language

Title:A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators