Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ashkan Zehfroosh

PAC Reinforcement Learning Algorithm for General-Sum Markov Games

Sep 05, 2020

Ashkan Zehfroosh, Herbert G. Tanner

Figure 1 for PAC Reinforcement Learning Algorithm for General-Sum Markov Games

Figure 2 for PAC Reinforcement Learning Algorithm for General-Sum Markov Games

Figure 3 for PAC Reinforcement Learning Algorithm for General-Sum Markov Games

Abstract:This paper presents a theoretical framework for probably approximately correct (PAC) multi-agent reinforcement learning (MARL) algorithms for Markov games. The paper offers an extension to the well-known Nash Q-learning algorithm, using the idea of delayed Q-learning, in order to build a new PAC MARL algorithm for general-sum Markov games. In addition to guiding the design of a provably PAC MARL algorithm, the framework enables checking whether an arbitrary MARL algorithm is PAC. Comparative numerical results demonstrate performance and robustness.

Via

Access Paper or Ask Questions

A Hybrid PAC Reinforcement Learning Algorithm

Sep 05, 2020

Ashkan Zehfroosh, Herbert G. Tanner

Figure 1 for A Hybrid PAC Reinforcement Learning Algorithm

Figure 2 for A Hybrid PAC Reinforcement Learning Algorithm

Abstract:This paper offers a new hybrid probably asymptotically correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of its parents. The designed algorithm, referred to as the Dyna-Delayed Q-learning (DDQ) algorithm, combines model-free and model-based learning approaches while outperforming both in most cases. The paper includes a PAC analysis of the DDQ algorithm and a derivation of its sample complexity. Numerical results that support the claim regarding the new algorithm's sample efficiency compared to its parents are showcased in a small grid-world example.

Via

Access Paper or Ask Questions