LLM-GLOBE: A Benchmark Evaluating the Cultural Values Embedded in LLM Output

Karinshak, Elise; Hu, Amanda; Kong, Kewen; Rao, Vishwanatha; Wang, Jingren; Wang, Jindong; Zeng, Yi

Computer Science > Computation and Language

arXiv:2411.06032 (cs)

[Submitted on 9 Nov 2024]

Title:LLM-GLOBE: A Benchmark Evaluating the Cultural Values Embedded in LLM Output

Authors:Elise Karinshak, Amanda Hu, Kewen Kong, Vishwanatha Rao, Jingren Wang, Jindong Wang, Yi Zeng

View PDF HTML (experimental)

Abstract:Immense effort has been dedicated to minimizing the presence of harmful or biased generative content and better aligning AI output to human intention; however, research investigating the cultural values of LLMs is still in very early stages. Cultural values underpin how societies operate, providing profound insights into the norms, priorities, and decision making of their members. In recognition of this need for further research, we draw upon cultural psychology theory and the empirically-validated GLOBE framework to propose the LLM-GLOBE benchmark for evaluating the cultural value systems of LLMs, and we then leverage the benchmark to compare the values of Chinese and US LLMs. Our methodology includes a novel "LLMs-as-a-Jury" pipeline which automates the evaluation of open-ended content to enable large-scale analysis at a conceptual level. Results clarify similarities and differences that exist between Eastern and Western cultural value systems and suggest that open-generation tasks represent a more promising direction for evaluation of cultural values. We interpret the implications of this research for subsequent model development, evaluation, and deployment efforts as they relate to LLMs, AI cultural alignment more broadly, and the influence of AI cultural value systems on human-AI collaboration outcomes.

Subjects:	Computation and Language (cs.CL)
ACM classes:	I.2.7
Cite as:	arXiv:2411.06032 [cs.CL]
	(or arXiv:2411.06032v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2411.06032

Submission history

From: Vishwanatha Rao [view email]
[v1] Sat, 9 Nov 2024 01:38:55 UTC (10,003 KB)

Computer Science > Computation and Language

Title:LLM-GLOBE: A Benchmark Evaluating the Cultural Values Embedded in LLM Output

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LLM-GLOBE: A Benchmark Evaluating the Cultural Values Embedded in LLM Output

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators