Picture for Keith W. Ross

Keith W. Ross

Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance

Add code
Nov 17, 2021
Figure 1 for Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Figure 2 for Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Figure 3 for Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Figure 4 for Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Viaarxiv icon

On-Policy Deep Reinforcement Learning for the Average-Reward Criterion

Add code
Jun 14, 2021
Figure 1 for On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Figure 2 for On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Figure 3 for On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Figure 4 for On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Viaarxiv icon

First Order Optimization in Policy Space for Constrained Deep Reinforcement Learning

Add code
Feb 16, 2020
Figure 1 for First Order Optimization in Policy Space for Constrained Deep Reinforcement Learning
Figure 2 for First Order Optimization in Policy Space for Constrained Deep Reinforcement Learning
Figure 3 for First Order Optimization in Policy Space for Constrained Deep Reinforcement Learning
Figure 4 for First Order Optimization in Policy Space for Constrained Deep Reinforcement Learning
Viaarxiv icon

Supervised Policy Update for Deep Reinforcement Learning

Add code
Dec 24, 2018
Figure 1 for Supervised Policy Update for Deep Reinforcement Learning
Figure 2 for Supervised Policy Update for Deep Reinforcement Learning
Figure 3 for Supervised Policy Update for Deep Reinforcement Learning
Figure 4 for Supervised Policy Update for Deep Reinforcement Learning
Viaarxiv icon

Efficient Entropy for Policy Gradient with Multidimensional Action Space

Add code
Jun 02, 2018
Figure 1 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 2 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 3 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 4 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Viaarxiv icon