Picture for James Kostas

James Kostas

Classical Policy Gradient: Preserving Bellman's Principle of Optimality

Add code
Jun 06, 2019
Viaarxiv icon

Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock

Add code
Feb 21, 2019
Figure 1 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 2 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 3 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 4 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Viaarxiv icon

Learning Action Representations for Reinforcement Learning

Add code
Feb 01, 2019
Figure 1 for Learning Action Representations for Reinforcement Learning
Figure 2 for Learning Action Representations for Reinforcement Learning
Figure 3 for Learning Action Representations for Reinforcement Learning
Figure 4 for Learning Action Representations for Reinforcement Learning
Viaarxiv icon