Picture for Stefanos Leonardos

Stefanos Leonardos

AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation

Add code
Nov 03, 2023
Viaarxiv icon

Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality

Add code
Jun 24, 2021
Figure 1 for Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality
Figure 2 for Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality
Figure 3 for Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality
Figure 4 for Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality
Viaarxiv icon

Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games

Add code
Jun 03, 2021
Figure 1 for Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
Figure 2 for Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
Figure 3 for Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
Figure 4 for Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
Viaarxiv icon