Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LOQA: Learning with Opponent Q-Learning Awareness

May 02, 2024

Milad Aghajohari, Juan Agustin Duque, Tim Cooijmans, Aaron Courville

Figure 1 for LOQA: Learning with Opponent Q-Learning Awareness

Figure 2 for LOQA: Learning with Opponent Q-Learning Awareness

Figure 3 for LOQA: Learning with Opponent Q-Learning Awareness

Figure 4 for LOQA: Learning with Opponent Q-Learning Awareness

Share this with someone who'll enjoy it:

Abstract:In various real-world scenarios, interactions among agents often resemble the dynamics of general-sum games, where each agent strives to optimize its own utility. Despite the ubiquitous relevance of such settings, decentralized machine learning algorithms have struggled to find equilibria that maximize individual utility while preserving social welfare. In this paper we introduce Learning with Opponent Q-Learning Awareness (LOQA), a novel, decentralized reinforcement learning algorithm tailored to optimizing an agent's individual utility while fostering cooperation among adversaries in partially competitive environments. LOQA assumes the opponent samples actions proportionally to their action-value function Q. Experimental results demonstrate the effectiveness of LOQA at achieving state-of-the-art performance in benchmark scenarios such as the Iterated Prisoner's Dilemma and the Coin Game. LOQA achieves these outcomes with a significantly reduced computational footprint, making it a promising approach for practical multi-agent applications.

* accepted to ICLR but still not in proceedings https://openreview.net/forum?id=FDQF6A1s6M

View paper on

Share this with someone who'll enjoy it:

Title:LOQA: Learning with Opponent Q-Learning Awareness

Paper and Code