Steering Prosocial AI Agents: Computational Basis of LLM's Decision Making in Social Simulation

Ma, Ji

Computer Science > Artificial Intelligence

arXiv:2504.11671 (cs)

[Submitted on 16 Apr 2025]

Title:Steering Prosocial AI Agents: Computational Basis of LLM's Decision Making in Social Simulation

Authors:Ji Ma

View PDF HTML (experimental)

Abstract:Large language models (LLMs) increasingly serve as human-like decision-making agents in social science and applied settings. These LLM-agents are typically assigned human-like characters and placed in real-life contexts. However, how these characters and contexts shape an LLM's behavior remains underexplored. This study proposes and tests methods for probing, quantifying, and modifying an LLM's internal representations in a Dictator Game -- a classic behavioral experiment on fairness and prosocial behavior. We extract ``vectors of variable variations'' (e.g., ``male'' to ``female'') from the LLM's internal state. Manipulating these vectors during the model's inference can substantially alter how those variables relate to the model's decision-making. This approach offers a principled way to study and regulate how social concepts can be encoded and engineered within transformer-based models, with implications for alignment, debiasing, and designing AI agents for social simulations in both academic and commercial applications.

Subjects:	Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG); General Economics (econ.GN)
Cite as:	arXiv:2504.11671 [cs.AI]
	(or arXiv:2504.11671v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2504.11671

Submission history

From: Ji Ma [view email]
[v1] Wed, 16 Apr 2025 00:02:28 UTC (510 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2025-04

Change to browse by:

cs
cs.CY
cs.LG
econ
econ.GN
q-fin
q-fin.EC

References & Citations

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Steering Prosocial AI Agents: Computational Basis of LLM's Decision Making in Social Simulation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Steering Prosocial AI Agents: Computational Basis of LLM's Decision Making in Social Simulation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators