Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning

Add code
Feb 24, 2020
Figure 1 for Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning
Figure 2 for Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: