Picture for Siva Theja Maguluri

Siva Theja Maguluri

Stochastic Approximation with Unbounded Markovian Noise: A General-Purpose Theorem

Add code
Oct 29, 2024
Viaarxiv icon

Markov Chain Variance Estimation: A Stochastic Approximation Approach

Add code
Sep 09, 2024
Figure 1 for Markov Chain Variance Estimation: A Stochastic Approximation Approach
Viaarxiv icon

Performance of NPG in Countable State-Space Average-Cost RL

Add code
May 30, 2024
Viaarxiv icon

Convergence for Natural Policy Gradient on Infinite-State Average-Reward Markov Decision Processes

Add code
Feb 07, 2024
Viaarxiv icon

Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise

Add code
Dec 31, 2023
Viaarxiv icon

Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise

Add code
Mar 28, 2023
Viaarxiv icon

Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation

Add code
Aug 05, 2022
Viaarxiv icon

Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling

Add code
Jun 21, 2022
Figure 1 for Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling
Figure 2 for Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling
Viaarxiv icon

Target Network and Truncation Overcome The Deadly triad in $Q$-Learning

Add code
Mar 05, 2022
Figure 1 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 2 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 3 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Figure 4 for Target Network and Truncation Overcome The Deadly triad in $Q$-Learning
Viaarxiv icon

Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization

Add code
Nov 11, 2021
Figure 1 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Figure 2 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Figure 3 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Figure 4 for Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Viaarxiv icon