Picture for L. A. Prashanth

L. A. Prashanth

Concentration Bounds for Optimized Certainty Equivalent Risk Estimation

Add code
May 31, 2024
Viaarxiv icon

Stochastic approximation for speeding up LSTD (and LSPI)

Add code
Nov 28, 2017
Figure 1 for Stochastic approximation for speeding up LSTD (and LSPI)
Figure 2 for Stochastic approximation for speeding up LSTD (and LSPI)
Figure 3 for Stochastic approximation for speeding up LSTD (and LSPI)
Viaarxiv icon

Weighted bandits or: How bandits learn distorted values that are not expected

Add code
Nov 30, 2016
Figure 1 for Weighted bandits or: How bandits learn distorted values that are not expected
Figure 2 for Weighted bandits or: How bandits learn distorted values that are not expected
Figure 3 for Weighted bandits or: How bandits learn distorted values that are not expected
Viaarxiv icon

On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence

Add code
Sep 01, 2015
Figure 1 for On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence
Figure 2 for On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence
Viaarxiv icon

Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games

Add code
Jul 02, 2015
Figure 1 for Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games
Figure 2 for Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games
Figure 3 for Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games
Figure 4 for Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games
Viaarxiv icon

Simultaneous Perturbation Algorithms for Batch Off-Policy Search

Add code
Mar 31, 2014
Figure 1 for Simultaneous Perturbation Algorithms for Batch Off-Policy Search
Figure 2 for Simultaneous Perturbation Algorithms for Batch Off-Policy Search
Figure 3 for Simultaneous Perturbation Algorithms for Batch Off-Policy Search
Viaarxiv icon