Picture for Anas Barakat

Anas Barakat

S2A, IDS, LTCI

Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players

Add code
Aug 15, 2024
Viaarxiv icon

Policy Mirror Descent with Lookahead

Add code
Mar 21, 2024
Figure 1 for Policy Mirror Descent with Lookahead
Figure 2 for Policy Mirror Descent with Lookahead
Figure 3 for Policy Mirror Descent with Lookahead
Viaarxiv icon

Independent Learning in Constrained Markov Potential Games

Add code
Feb 27, 2024
Figure 1 for Independent Learning in Constrained Markov Potential Games
Figure 2 for Independent Learning in Constrained Markov Potential Games
Figure 3 for Independent Learning in Constrained Markov Potential Games
Figure 4 for Independent Learning in Constrained Markov Potential Games
Viaarxiv icon

Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity

Add code
Sep 08, 2023
Figure 1 for Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity
Figure 2 for Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity
Viaarxiv icon

Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space

Add code
Jun 02, 2023
Figure 1 for Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
Viaarxiv icon

Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies

Add code
Feb 03, 2023
Figure 1 for Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
Figure 2 for Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
Figure 3 for Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
Figure 4 for Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
Viaarxiv icon

Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation

Add code
Jun 14, 2021
Figure 1 for Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Viaarxiv icon

Convergence Analysis of a Momentum Algorithm with Adaptive Step Size for Non Convex Optimization

Add code
Nov 18, 2019
Viaarxiv icon