Picture for Kenny Young

Kenny Young

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning

Add code
May 06, 2024
Viaarxiv icon

Iterative Option Discovery for Planning, by Planning

Add code
Oct 02, 2023
Viaarxiv icon

The Benefits of Model-Based Generalization in Reinforcement Learning

Add code
Nov 04, 2022
Viaarxiv icon

Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

Add code
Jul 04, 2022
Figure 1 for Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
Figure 2 for Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
Figure 3 for Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
Figure 4 for Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
Viaarxiv icon

Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units

Add code
Oct 14, 2021
Figure 1 for Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
Figure 2 for Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
Figure 3 for Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
Figure 4 for Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
Viaarxiv icon

Hindsight Network Credit Assignment

Add code
Nov 24, 2020
Figure 1 for Hindsight Network Credit Assignment
Figure 2 for Hindsight Network Credit Assignment
Figure 3 for Hindsight Network Credit Assignment
Viaarxiv icon

Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning

Add code
Oct 28, 2020
Figure 1 for Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
Figure 2 for Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
Figure 3 for Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
Figure 4 for Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
Viaarxiv icon

Variance Reduced Advantage Estimation with $δ$ Hindsight Credit Assignment

Add code
Jan 09, 2020
Figure 1 for Variance Reduced Advantage Estimation with $δ$ Hindsight Credit Assignment
Viaarxiv icon

MinAtar: An Atari-inspired Testbed for More Efficient Reinforcement Learning Experiments

Add code
Mar 07, 2019
Figure 1 for MinAtar: An Atari-inspired Testbed for More Efficient Reinforcement Learning Experiments
Figure 2 for MinAtar: An Atari-inspired Testbed for More Efficient Reinforcement Learning Experiments
Figure 3 for MinAtar: An Atari-inspired Testbed for More Efficient Reinforcement Learning Experiments
Viaarxiv icon

Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control

Add code
May 10, 2018
Figure 1 for Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
Figure 2 for Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
Figure 3 for Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
Figure 4 for Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
Viaarxiv icon