Picture for Bruno Scherrer

Bruno Scherrer

BIGS

Infinite-Horizon Offline Reinforcement Learning with Linear Function Approximation: Curse of Dimensionality and Algorithm

Add code
Mar 17, 2021
Figure 1 for Infinite-Horizon Offline Reinforcement Learning with Linear Function Approximation: Curse of Dimensionality and Algorithm
Viaarxiv icon

Leverage the Average: an Analysis of Regularization in RL

Add code
Apr 10, 2020
Figure 1 for Leverage the Average: an Analysis of Regularization in RL
Figure 2 for Leverage the Average: an Analysis of Regularization in RL
Figure 3 for Leverage the Average: an Analysis of Regularization in RL
Figure 4 for Leverage the Average: an Analysis of Regularization in RL
Viaarxiv icon

Momentum in Reinforcement Learning

Add code
Oct 21, 2019
Figure 1 for Momentum in Reinforcement Learning
Figure 2 for Momentum in Reinforcement Learning
Figure 3 for Momentum in Reinforcement Learning
Figure 4 for Momentum in Reinforcement Learning
Viaarxiv icon

A Theory of Regularized Markov Decision Processes

Add code
Jan 31, 2019
Viaarxiv icon

Anderson Acceleration for Reinforcement Learning

Add code
Sep 25, 2018
Figure 1 for Anderson Acceleration for Reinforcement Learning
Figure 2 for Anderson Acceleration for Reinforcement Learning
Viaarxiv icon

Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning

Add code
Sep 20, 2018
Figure 1 for Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning
Viaarxiv icon

How to Combine Tree-Search Methods in Reinforcement Learning

Add code
Sep 06, 2018
Figure 1 for How to Combine Tree-Search Methods in Reinforcement Learning
Figure 2 for How to Combine Tree-Search Methods in Reinforcement Learning
Figure 3 for How to Combine Tree-Search Methods in Reinforcement Learning
Figure 4 for How to Combine Tree-Search Methods in Reinforcement Learning
Viaarxiv icon

Beyond the One Step Greedy Approach in Reinforcement Learning

Add code
Jul 30, 2018
Figure 1 for Beyond the One Step Greedy Approach in Reinforcement Learning
Viaarxiv icon

Improved and Generalized Upper Bounds on the Complexity of Policy Iteration

Add code
Feb 10, 2016
Viaarxiv icon

Rate of Convergence and Error Bounds for LSTD

Add code
May 13, 2014
Figure 1 for Rate of Convergence and Error Bounds for LSTD
Viaarxiv icon