Transformer-Based Contextualized Language Models Joint with Neural Networks for Natural Language Inference in Vietnamese

Nguyen, Dat Van-Thanh; Van Huynh, Tin; Van Nguyen, Kiet; Nguyen, Ngan Luu-Thuy

Computer Science > Computation and Language

arXiv:2411.13407 (cs)

[Submitted on 20 Nov 2024 (v1), last revised 21 Nov 2024 (this version, v2)]

Title:Transformer-Based Contextualized Language Models Joint with Neural Networks for Natural Language Inference in Vietnamese

Authors:Dat Van-Thanh Nguyen, Tin Van Huynh, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

View PDF HTML (experimental)

Abstract:Natural Language Inference (NLI) is a task within Natural Language Processing (NLP) that holds value for various AI applications. However, there have been limited studies on Natural Language Inference in Vietnamese that explore the concept of joint models. Therefore, we conducted experiments using various combinations of contextualized language models (CLM) and neural networks. We use CLM to create contextualized work presentations and use Neural Networks for classification. Furthermore, we have evaluated the strengths and weaknesses of each joint model and identified the model failure points in the Vietnamese context. The highest F1 score in this experiment, up to 82.78% in the benchmark dataset (ViNLI). By conducting experiments with various models, the most considerable size of the CLM is XLM-R (355M). That combination has consistently demonstrated superior performance compared to fine-tuning strong pre-trained language models like PhoBERT (+6.58%), mBERT (+19.08%), and XLM-R (+0.94%) in terms of F1-score. This article aims to introduce a novel approach or model that attains improved performance for Vietnamese NLI. Overall, we find that the joint approach of CLM and neural networks is simple yet capable of achieving high-quality performance, which makes it suitable for applications that require efficient resource utilization.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2411.13407 [cs.CL]
	(or arXiv:2411.13407v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2411.13407

Submission history

From: Dat Nguyen Van-Thanh [view email]
[v1] Wed, 20 Nov 2024 15:46:48 UTC (1,233 KB)
[v2] Thu, 21 Nov 2024 02:27:38 UTC (1,233 KB)

Computer Science > Computation and Language

Title:Transformer-Based Contextualized Language Models Joint with Neural Networks for Natural Language Inference in Vietnamese

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Transformer-Based Contextualized Language Models Joint with Neural Networks for Natural Language Inference in Vietnamese

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators