Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Jul 01, 2019

Alex X. Lee, Anusha Nagabandi, Pieter Abbeel, Sergey Levine

Figure 1 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Figure 2 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Figure 3 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Figure 4 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Share this with someone who'll enjoy it:

Abstract:Deep reinforcement learning (RL) algorithms can use high-capacity deep networks to learn directly from image observations. However, these kinds of observation spaces present a number of challenges in practice, since the policy must now solve two problems: a representation learning problem, and a task learning problem. In this paper, we aim to explicitly learn representations that can accelerate reinforcement learning from images. We propose the stochastic latent actor-critic (SLAC) algorithm: a sample-efficient and high-performing RL algorithm for learning policies for complex continuous control tasks directly from high-dimensional image inputs. SLAC learns a compact latent representation space using a stochastic sequential latent variable model, and then learns a critic model within this latent space. By learning a critic within a compact state space, SLAC can learn much more efficiently than standard RL methods. The proposed model improves performance substantially over alternative representations as well, such as variational autoencoders. In fact, our experimental evaluation demonstrates that the sample efficiency of our resulting method is comparable to that of model-based RL methods that directly use a similar type of model for control. Furthermore, our method outperforms both model-free and model-based alternatives in terms of final performance and sample efficiency, on a range of difficult image-based control tasks.

* Project website: https://alexlee-gk.github.io/slac/

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Paper and Code