Picture for Navdeep Kumar

Navdeep Kumar

Optimal Sample Complexity for Single Time-Scale Actor-Critic with Momentum

Add code
Feb 02, 2026
Viaarxiv icon

Policy Gradient with Tree Search: Avoiding Local Optimas through Lookahead

Add code
Jun 08, 2025
Figure 1 for Policy Gradient with Tree Search: Avoiding Local Optimas through Lookahead
Figure 2 for Policy Gradient with Tree Search: Avoiding Local Optimas through Lookahead
Figure 3 for Policy Gradient with Tree Search: Avoiding Local Optimas through Lookahead
Figure 4 for Policy Gradient with Tree Search: Avoiding Local Optimas through Lookahead
Viaarxiv icon

Dual Formulation for Non-Rectangular Lp Robust Markov Decision Processes

Add code
Feb 13, 2025
Figure 1 for Dual Formulation for Non-Rectangular Lp Robust Markov Decision Processes
Figure 2 for Dual Formulation for Non-Rectangular Lp Robust Markov Decision Processes
Figure 3 for Dual Formulation for Non-Rectangular Lp Robust Markov Decision Processes
Figure 4 for Dual Formulation for Non-Rectangular Lp Robust Markov Decision Processes
Viaarxiv icon

Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms

Add code
Oct 11, 2024
Figure 1 for Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms
Figure 2 for Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms
Viaarxiv icon

On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

Add code
Mar 11, 2024
Figure 1 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 2 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 3 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Figure 4 for On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Viaarxiv icon

Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization

Add code
Sep 03, 2023
Viaarxiv icon

Robust Reinforcement Learning via Adversarial Kernel Approximation

Add code
Jun 09, 2023
Figure 1 for Robust Reinforcement Learning via Adversarial Kernel Approximation
Figure 2 for Robust Reinforcement Learning via Adversarial Kernel Approximation
Figure 3 for Robust Reinforcement Learning via Adversarial Kernel Approximation
Figure 4 for Robust Reinforcement Learning via Adversarial Kernel Approximation
Viaarxiv icon

Policy Gradient for s-Rectangular Robust Markov Decision Processes

Add code
Jan 31, 2023
Figure 1 for Policy Gradient for s-Rectangular Robust Markov Decision Processes
Figure 2 for Policy Gradient for s-Rectangular Robust Markov Decision Processes
Figure 3 for Policy Gradient for s-Rectangular Robust Markov Decision Processes
Figure 4 for Policy Gradient for s-Rectangular Robust Markov Decision Processes
Viaarxiv icon

An Efficient Solution to s-Rectangular Robust Markov Decision Processes

Add code
Jan 31, 2023
Figure 1 for An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Figure 2 for An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Figure 3 for An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Figure 4 for An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Viaarxiv icon

Policy Gradient for Reinforcement Learning with General Utilities

Add code
Oct 03, 2022
Viaarxiv icon