Beyond Profile: From Surface-Level Facts to Deep Persona Simulation in LLMs

Wang, Zixiao; Zhang, Duzhen; Agrawal, Ishita; Gao, Shen; Song, Le; Chen, Xiuying

Abstract:Previous approaches to persona simulation large language models (LLMs) have typically relied on learning basic biographical information, or using limited role-play dialogue datasets to capture a character's responses. However, a holistic representation of an individual goes beyond surface-level facts or conversations to deeper thoughts and thinking. In this work, we introduce CharacterBot, a model designed to replicate both the linguistic patterns and distinctive thought processes of a character. Using Lu Xun, a renowned Chinese writer, as a case study, we propose four training tasks derived from his 17 essay collections. These include a pre-training task focused on mastering external linguistic structures and knowledge, as well as three fine-tuning tasks: multiple-choice question answering, generative question answering, and style transfer, each aligning the LLM with Lu Xun's internal ideation and writing style. To optimize learning across these tasks, we introduce a CharLoRA parameter updating mechanism, where a general linguistic style expert collaborates with other task-specific experts to better study both the language style and the understanding of deeper thoughts. We evaluate CharacterBot on three tasks for linguistic accuracy and opinion comprehension, demonstrating that it significantly outperforms the baselines on our adapted metrics. We hope that this work inspires future research on deep character persona simulation LLM.

Comments:	19 pages, 3 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.12988 [cs.CL]
	(or arXiv:2502.12988v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.12988

Computer Science > Computation and Language

Title:Beyond Profile: From Surface-Level Facts to Deep Persona Simulation in LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators