Picture for Washim Uddin Mondal

Washim Uddin Mondal

Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs

Add code
Aug 21, 2024
Viaarxiv icon

Sample-Efficient Constrained Reinforcement Learning with General Parameterization

Add code
May 17, 2024
Viaarxiv icon

Variance-Reduced Policy Gradient Approaches for Infinite Horizon Average Reward Markov Decision Processes

Add code
Apr 02, 2024
Viaarxiv icon

Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm

Add code
Feb 03, 2024
Viaarxiv icon

Improved Sample Complexity Analysis of Natural Policy Gradient Algorithm with General Parameterization for Infinite Horizon Discounted Reward Markov Decision Processes

Add code
Oct 18, 2023
Viaarxiv icon

Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes

Add code
Sep 05, 2023
Viaarxiv icon

Cooperating Graph Neural Networks with Deep Reinforcement Learning for Vaccine Prioritization

Add code
May 09, 2023
Viaarxiv icon

Reinforcement Learning with Delayed, Composite, and Partially Anonymous Reward

Add code
May 04, 2023
Viaarxiv icon

Mean-Field Control based Approximation of Multi-Agent Reinforcement Learning in Presence of a Non-decomposable Shared Global State

Add code
Jan 13, 2023
Viaarxiv icon

Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)

Add code
Sep 15, 2022
Figure 1 for Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
Viaarxiv icon