Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game

Ye, Rong; Zhang, Yongxin; Zhang, Yikai; Kuang, Haoyu; Wei, Zhongyu; Sun, Peng

Computer Science > Computation and Language

arXiv:2501.14225 (cs)

[Submitted on 24 Jan 2025]

Title:Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game

Authors:Rong Ye, Yongxin Zhang, Yikai Zhang, Haoyu Kuang, Zhongyu Wei, Peng Sun

View PDF HTML (experimental)

Abstract:Achieving Artificial General Intelligence (AGI) requires AI agents that can not only make stratigic decisions but also engage in flexible and meaningful communication. Inspired by Wittgenstein's language game theory in Philosophical Investigations, we propose that language agents can learn through in-context interaction rather than traditional multi-stage frameworks that separate decision-making from language expression. Using Werewolf, a social deduction game that tests language understanding, strategic interaction, and adaptability, we develop the Multi-agent Kahneman & Tversky's Optimization (MaKTO). MaKTO engages diverse models in extensive gameplay to generate unpaired desirable and unacceptable responses, then employs KTO to refine the model's decision-making process. In 9-player Werewolf games, MaKTO achieves a 61% average win rate across various models, outperforming GPT-4o and two-stage RL agents by relative improvements of 23.0% and 10.9%, respectively. Notably, MaKTO also demonstrates human-like performance, winning 60% against expert players and showing only 49% detectability in Turing-style blind tests. These results showcase MaKTO's superior decision-making, strategic adaptation, and natural language generation in complex social deduction games.

Comments:	Preprint. Code and data will be available at this https URL
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2501.14225 [cs.CL]
	(or arXiv:2501.14225v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.14225

Submission history

From: Rong Ye [view email]
[v1] Fri, 24 Jan 2025 04:09:03 UTC (5,039 KB)

Computer Science > Computation and Language

Title:Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators