Picture for Anas Barakat

Anas Barakat

S2A, IDS, LTCI

On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

Add code
Oct 05, 2024
Viaarxiv icon

Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning

Add code
Oct 03, 2024
Viaarxiv icon

Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players

Add code
Aug 15, 2024
Viaarxiv icon

Policy Mirror Descent with Lookahead

Add code
Mar 21, 2024
Viaarxiv icon

Independent Learning in Constrained Markov Potential Games

Add code
Feb 27, 2024
Viaarxiv icon

Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity

Add code
Sep 08, 2023
Viaarxiv icon

Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space

Add code
Jun 02, 2023
Viaarxiv icon

Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies

Add code
Feb 03, 2023
Viaarxiv icon

Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation

Add code
Jun 14, 2021
Figure 1 for Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Viaarxiv icon

Convergence Analysis of a Momentum Algorithm with Adaptive Step Size for Non Convex Optimization

Add code
Nov 18, 2019
Viaarxiv icon