Picture for Prashanth L. A

Prashanth L. A

Generalized Simultaneous Perturbation Stochastic Approximation with Reduced Estimator Bias

Add code
Dec 20, 2022
Viaarxiv icon

Approximate gradient ascent methods for distortion risk measures

Add code
Feb 22, 2022
Figure 1 for Approximate gradient ascent methods for distortion risk measures
Figure 2 for Approximate gradient ascent methods for distortion risk measures
Viaarxiv icon

Likelihood ratio-based policy gradient methods for distorted risk measures: A non-asymptotic analysis

Add code
Jul 14, 2021
Figure 1 for Likelihood ratio-based policy gradient methods for distorted risk measures: A non-asymptotic analysis
Figure 2 for Likelihood ratio-based policy gradient methods for distorted risk measures: A non-asymptotic analysis
Viaarxiv icon

Smoothed functional-based gradient algorithms for off-policy reinforcement learning

Add code
Jan 06, 2021
Figure 1 for Smoothed functional-based gradient algorithms for off-policy reinforcement learning
Figure 2 for Smoothed functional-based gradient algorithms for off-policy reinforcement learning
Viaarxiv icon

Improved Concentration Bounds for Conditional Value-at-Risk and Cumulative Prospect Theory using Wasserstein distance

Add code
Feb 27, 2019
Figure 1 for Improved Concentration Bounds for Conditional Value-at-Risk and Cumulative Prospect Theory using Wasserstein distance
Viaarxiv icon

Correlated bandits or: How to minimize mean-squared error online

Add code
Feb 08, 2019
Figure 1 for Correlated bandits or: How to minimize mean-squared error online
Viaarxiv icon