AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic

Alghamdi, Emad A.; Masoud, Reem I.; Alnuhait, Deema; Alomairi, Afnan Y.; Ashraf, Ahmed; Zaytoon, Mohamed

Computer Science > Computation and Language

arXiv:2403.09017 (cs)

[Submitted on 14 Mar 2024 (v1), last revised 5 Nov 2024 (this version, v3)]

Title:AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic

Authors:Emad A. Alghamdi, Reem I. Masoud, Deema Alnuhait, Afnan Y. Alomairi, Ahmed Ashraf, Mohamed Zaytoon

View PDF HTML (experimental)

Abstract:The swift progress and widespread acceptance of artificial intelligence (AI) systems highlight a pressing requirement to comprehend both the capabilities and potential risks associated with AI. Given the linguistic complexity, cultural richness, and underrepresented status of Arabic in AI research, there is a pressing need to focus on Large Language Models (LLMs) performance and safety for Arabic-related tasks. Despite some progress in their development, there is a lack of comprehensive trustworthiness evaluation benchmarks, which presents a major challenge in accurately assessing and improving the safety of LLMs when prompted in Arabic. In this paper, we introduce AraTrust, the first comprehensive trustworthiness benchmark for LLMs in Arabic. AraTrust comprises 522 human-written multiple-choice questions addressing diverse dimensions related to truthfulness, ethics, safety, physical health, mental health, unfairness, illegal activities, privacy, and offensive language. We evaluated a set of LLMs against our benchmark to assess their trustworthiness. GPT-4 was the most trustworthy LLM, while open-source models, particularly AceGPT 7B and Jais 13B, struggled to achieve a score of 60% in our benchmark.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2403.09017 [cs.CL]
	(or arXiv:2403.09017v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.09017

Submission history

From: Deema Alnuhait [view email]
[v1] Thu, 14 Mar 2024 00:45:24 UTC (5,635 KB)
[v2] Fri, 15 Mar 2024 23:52:18 UTC (5,635 KB)
[v3] Tue, 5 Nov 2024 02:19:26 UTC (5,636 KB)

Computer Science > Computation and Language

Title:AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators