Picture for Longxiang Shi

Longxiang Shi

FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control

Add code
Jul 10, 2019
Figure 1 for FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control
Figure 2 for FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control
Figure 3 for FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control
Figure 4 for FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control
Viaarxiv icon

TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning

Add code
May 17, 2019
Figure 1 for TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning
Figure 2 for TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning
Figure 3 for TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning
Figure 4 for TBQ($σ$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning
Viaarxiv icon