LOQA: Learning with Opponent Q-Learning Awareness

Aghajohari, Milad; Duque, Juan Agustin; Cooijmans, Tim; Courville, Aaron

Computer Science > Computer Science and Game Theory

arXiv:2405.01035 (cs)

[Submitted on 2 May 2024]

Title:LOQA: Learning with Opponent Q-Learning Awareness

Authors:Milad Aghajohari, Juan Agustin Duque, Tim Cooijmans, Aaron Courville

View PDF HTML (experimental)

Abstract:In various real-world scenarios, interactions among agents often resemble the dynamics of general-sum games, where each agent strives to optimize its own utility. Despite the ubiquitous relevance of such settings, decentralized machine learning algorithms have struggled to find equilibria that maximize individual utility while preserving social welfare. In this paper we introduce Learning with Opponent Q-Learning Awareness (LOQA), a novel, decentralized reinforcement learning algorithm tailored to optimizing an agent's individual utility while fostering cooperation among adversaries in partially competitive environments. LOQA assumes the opponent samples actions proportionally to their action-value function Q. Experimental results demonstrate the effectiveness of LOQA at achieving state-of-the-art performance in benchmark scenarios such as the Iterated Prisoner's Dilemma and the Coin Game. LOQA achieves these outcomes with a significantly reduced computational footprint, making it a promising approach for practical multi-agent applications.

Comments:	accepted to ICLR but still not in proceedings this https URL
Subjects:	Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2405.01035 [cs.GT]
	(or arXiv:2405.01035v1 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2405.01035

Submission history

From: Milad Aghajohari [view email]
[v1] Thu, 2 May 2024 06:33:01 UTC (708 KB)

Computer Science > Computer Science and Game Theory

Title:LOQA: Learning with Opponent Q-Learning Awareness

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:LOQA: Learning with Opponent Q-Learning Awareness

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators