Picture for P. R. Kumar

P. R. Kumar

Linear Convergence of Independent Natural Policy Gradient in Games with Entropy Regularization

Add code
May 04, 2024
Viaarxiv icon

Provable Policy Gradient Methods for Average-Reward Markov Potential Games

Add code
Mar 09, 2024
Viaarxiv icon

Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games

Add code
Oct 27, 2023
Viaarxiv icon

Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs

Add code
Oct 17, 2023
Viaarxiv icon

Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation

Add code
Jul 17, 2023
Viaarxiv icon

Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs

Add code
May 26, 2023
Figure 1 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Figure 2 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Figure 3 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Figure 4 for Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs
Viaarxiv icon

Recommender system as an exploration coordinator: a bounded O(1) regret algorithm for large platforms

Add code
Jan 29, 2023
Viaarxiv icon

TERRA: Beam Management for Outdoor mm-Wave Networks

Add code
Jan 10, 2023
Viaarxiv icon

Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality

Add code
Nov 02, 2022
Viaarxiv icon

Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning

Add code
Jun 10, 2022
Figure 1 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 2 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 3 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 4 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Viaarxiv icon