Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs

Lu, Yuxuan; Huang, Jing; Han, Yan; Bei, Bennet; Xie, Yaochen; Wang, Dakuo; Wang, Jessie; He, Qi

Computer Science > Computation and Language

arXiv:2503.20749 (cs)

[Submitted on 26 Mar 2025 (v1), last revised 27 Mar 2025 (this version, v2)]

Title:Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs

Authors:Yuxuan Lu, Jing Huang, Yan Han, Bennet Bei, Yaochen Xie, Dakuo Wang, Jessie Wang, Qi He

View PDF HTML (experimental)

Abstract:Recent research shows that LLMs can simulate ``believable'' human behaviors to power LLM agents via prompt-only methods. In this work, we focus on evaluating and improving LLM's objective ``accuracy'' rather than the subjective ``believability'' in the web action generation task, leveraging a large-scale, real-world dataset collected from online shopping human actions. We present the first comprehensive quantitative evaluation of state-of-the-art LLMs (e.g., DeepSeek-R1, Llama, and Claude) on the task of web action generation. Our results show that fine-tuning LLMs on real-world behavioral data substantially improves their ability to generate actions compared to prompt-only methods. Furthermore, incorporating synthesized reasoning traces into model training leads to additional performance gains, demonstrating the value of explicit rationale in behavior modeling. This work establishes a new benchmark for evaluating LLMs in behavior simulation and offers actionable insights into how real-world action data and reasoning augmentation can enhance the fidelity of LLM agents.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2503.20749 [cs.CL]
	(or arXiv:2503.20749v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.20749

Submission history

From: Yuxuan Lu [view email]
[v1] Wed, 26 Mar 2025 17:33:27 UTC (253 KB)
[v2] Thu, 27 Mar 2025 02:42:03 UTC (468 KB)

Computer Science > Computation and Language

Title:Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators