Picture for Sean P. Meyn

Sean P. Meyn

Convex Q-Learning, Part 1: Deterministic Optimal Control

Add code
Aug 08, 2020
Viaarxiv icon

Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning

Add code
Feb 24, 2020
Figure 1 for Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning
Figure 2 for Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning
Viaarxiv icon

Zap Q-Learning for Optimal Stopping Time Problems

Add code
May 01, 2019
Figure 1 for Zap Q-Learning for Optimal Stopping Time Problems
Viaarxiv icon

Differential Temporal Difference Learning

Add code
Dec 28, 2018
Figure 1 for Differential Temporal Difference Learning
Figure 2 for Differential Temporal Difference Learning
Figure 3 for Differential Temporal Difference Learning
Figure 4 for Differential Temporal Difference Learning
Viaarxiv icon

Fastest Convergence for Q-learning

Add code
Mar 21, 2018
Figure 1 for Fastest Convergence for Q-learning
Figure 2 for Fastest Convergence for Q-learning
Figure 3 for Fastest Convergence for Q-learning
Figure 4 for Fastest Convergence for Q-learning
Viaarxiv icon

Differential TD Learning for Value Function Approximation

Add code
Apr 06, 2016
Figure 1 for Differential TD Learning for Value Function Approximation
Figure 2 for Differential TD Learning for Value Function Approximation
Figure 3 for Differential TD Learning for Value Function Approximation
Figure 4 for Differential TD Learning for Value Function Approximation
Viaarxiv icon