Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Model-Based Decentralized Policy Optimization

Feb 16, 2023

Hao Luo, Jiechuan Jiang, Zongqing Lu

Figure 1 for Model-Based Decentralized Policy Optimization

Figure 2 for Model-Based Decentralized Policy Optimization

Figure 3 for Model-Based Decentralized Policy Optimization

Figure 4 for Model-Based Decentralized Policy Optimization

Share this with someone who'll enjoy it:

Abstract:Decentralized policy optimization has been commonly used in cooperative multi-agent tasks. However, since all agents are updating their policies simultaneously, from the perspective of individual agents, the environment is non-stationary, resulting in it being hard to guarantee monotonic policy improvement. To help the policy improvement be stable and monotonic, we propose model-based decentralized policy optimization (MDPO), which incorporates a latent variable function to help construct the transition and reward function from an individual perspective. We theoretically analyze that the policy optimization of MDPO is more stable than model-free decentralized policy optimization. Moreover, due to non-stationarity, the latent variable function is varying and hard to be modeled. We further propose a latent variable prediction method to reduce the error of the latent variable function, which theoretically contributes to the monotonic policy improvement. Empirically, MDPO can indeed obtain superior performance than model-free decentralized policy optimization in a variety of cooperative multi-agent tasks.

* 24 pages

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Model-Based Decentralized Policy Optimization

Paper and Code