Picture for Nathaniel Korda

Nathaniel Korda

Stochastic approximation for speeding up LSTD (and LSPI)

Add code
Nov 28, 2017
Figure 1 for Stochastic approximation for speeding up LSTD (and LSPI)
Figure 2 for Stochastic approximation for speeding up LSTD (and LSPI)
Figure 3 for Stochastic approximation for speeding up LSTD (and LSPI)
Viaarxiv icon

On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence

Add code
Sep 01, 2015
Figure 1 for On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence
Figure 2 for On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence
Viaarxiv icon

Fast gradient descent for drifting least squares regression, with application to bandits

Add code
Nov 20, 2014
Figure 1 for Fast gradient descent for drifting least squares regression, with application to bandits
Figure 2 for Fast gradient descent for drifting least squares regression, with application to bandits
Figure 3 for Fast gradient descent for drifting least squares regression, with application to bandits
Figure 4 for Fast gradient descent for drifting least squares regression, with application to bandits
Viaarxiv icon

Finite-Time Analysis of Kernelised Contextual Bandits

Add code
Sep 26, 2013
Figure 1 for Finite-Time Analysis of Kernelised Contextual Bandits
Viaarxiv icon

Thompson Sampling for 1-Dimensional Exponential Family Bandits

Add code
Jul 12, 2013
Figure 1 for Thompson Sampling for 1-Dimensional Exponential Family Bandits
Viaarxiv icon

Thompson Sampling: An Asymptotically Optimal Finite Time Analysis

Add code
Jul 19, 2012
Figure 1 for Thompson Sampling: An Asymptotically Optimal Finite Time Analysis
Figure 2 for Thompson Sampling: An Asymptotically Optimal Finite Time Analysis
Viaarxiv icon