Picture for Tianbing Xu

Tianbing Xu

WALL-E: An Efficient Reinforcement Learning Research Framework

Add code
Jan 28, 2019
Figure 1 for WALL-E: An Efficient Reinforcement Learning Research Framework
Figure 2 for WALL-E: An Efficient Reinforcement Learning Research Framework
Figure 3 for WALL-E: An Efficient Reinforcement Learning Research Framework
Figure 4 for WALL-E: An Efficient Reinforcement Learning Research Framework
Viaarxiv icon

Stochastic Variance Reduction for Policy Gradient Estimation

Add code
Mar 29, 2018
Figure 1 for Stochastic Variance Reduction for Policy Gradient Estimation
Figure 2 for Stochastic Variance Reduction for Policy Gradient Estimation
Figure 3 for Stochastic Variance Reduction for Policy Gradient Estimation
Figure 4 for Stochastic Variance Reduction for Policy Gradient Estimation
Viaarxiv icon

Learning to Explore with Meta-Policy Gradient

Add code
Mar 26, 2018
Figure 1 for Learning to Explore with Meta-Policy Gradient
Figure 2 for Learning to Explore with Meta-Policy Gradient
Figure 3 for Learning to Explore with Meta-Policy Gradient
Figure 4 for Learning to Explore with Meta-Policy Gradient
Viaarxiv icon

Variational Inference for Policy Gradient

Add code
Mar 25, 2018
Viaarxiv icon

Thompson Sampling in Dynamic Systems for Contextual Bandit Problems

Add code
Oct 17, 2013
Figure 1 for Thompson Sampling in Dynamic Systems for Contextual Bandit Problems
Figure 2 for Thompson Sampling in Dynamic Systems for Contextual Bandit Problems
Figure 3 for Thompson Sampling in Dynamic Systems for Contextual Bandit Problems
Figure 4 for Thompson Sampling in Dynamic Systems for Contextual Bandit Problems
Viaarxiv icon

Online Classification Using a Voted RDA Method

Add code
Oct 17, 2013
Figure 1 for Online Classification Using a Voted RDA Method
Figure 2 for Online Classification Using a Voted RDA Method
Figure 3 for Online Classification Using a Voted RDA Method
Figure 4 for Online Classification Using a Voted RDA Method
Viaarxiv icon