Picture for Chris Nota

Chris Nota

On the Convergence of Discounted Policy Gradient Methods

Add code
Jan 09, 2023
Viaarxiv icon

Learning Reusable Options for Multi-Task Reinforcement Learning

Add code
Jan 06, 2020
Figure 1 for Learning Reusable Options for Multi-Task Reinforcement Learning
Figure 2 for Learning Reusable Options for Multi-Task Reinforcement Learning
Figure 3 for Learning Reusable Options for Multi-Task Reinforcement Learning
Figure 4 for Learning Reusable Options for Multi-Task Reinforcement Learning
Viaarxiv icon

Is the Policy Gradient a Gradient?

Add code
Jun 17, 2019
Figure 1 for Is the Policy Gradient a Gradient?
Figure 2 for Is the Policy Gradient a Gradient?
Figure 3 for Is the Policy Gradient a Gradient?
Viaarxiv icon

Classical Policy Gradient: Preserving Bellman's Principle of Optimality

Add code
Jun 06, 2019
Viaarxiv icon

Lifelong Learning with a Changing Action Set

Add code
Jun 05, 2019
Figure 1 for Lifelong Learning with a Changing Action Set
Figure 2 for Lifelong Learning with a Changing Action Set
Figure 3 for Lifelong Learning with a Changing Action Set
Figure 4 for Lifelong Learning with a Changing Action Set
Viaarxiv icon

Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock

Add code
Feb 21, 2019
Figure 1 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 2 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 3 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 4 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Viaarxiv icon