Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kim Heecheol

Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder

Mar 22, 2019

Kim Heecheol, Masanori Yamada, Kosuke Miyoshi, Hiroshi Yamakawa

Figure 1 for Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder

Figure 2 for Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder

Figure 3 for Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder

Figure 4 for Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder

Abstract:One problem in the application of reinforcement learning to real-world problems is the curse of dimensionality on the action space. Macro actions, a sequence of primitive actions, have been studied to diminish the dimensionality of the action space with regard to the time axis. However, previous studies relied on humans defining macro actions or assumed macro actions as repetitions of the same primitive actions. We present Factorized Macro Action Reinforcement Learning (FaMARL) which autonomously learns disentangled factor representation of a sequence of actions to generate macro actions that can be directly applied to general reinforcement learning algorithms. FaMARL exhibits higher scores than other reinforcement learning algorithms on environments that require an extensive amount of search.

* First and second authors equally contributed to this paper

Via

Access Paper or Ask Questions

FAVAE: Sequence Disentanglement using Information Bottleneck Principle

Feb 22, 2019

Masanori Yamada, Kim Heecheol, Kosuke Miyoshi, Hiroshi Yamakawa

Figure 1 for FAVAE: Sequence Disentanglement using Information Bottleneck Principle

Figure 2 for FAVAE: Sequence Disentanglement using Information Bottleneck Principle

Figure 3 for FAVAE: Sequence Disentanglement using Information Bottleneck Principle

Figure 4 for FAVAE: Sequence Disentanglement using Information Bottleneck Principle

Abstract:We propose the factorized action variational autoencoder (FAVAE), a state-of-the-art generative model for learning disentangled and interpretable representations from sequential data via the information bottleneck without supervision. The purpose of disentangled representation learning is to obtain interpretable and transferable representations from data. We focused on the disentangled representation of sequential data since there is a wide range of potential applications if disentanglement representation is extended to sequential data such as video, speech, and stock market. Sequential data are characterized by dynamic and static factors: dynamic factors are time dependent, and static factors are independent of time. Previous models disentangle static and dynamic factors by explicitly modeling the priors of latent variables to distinguish between these factors. However, these models cannot disentangle representations between dynamic factors, such as disentangling "picking up" and "throwing" in robotic tasks. FAVAE can disentangle multiple dynamic factors. Since it does not require modeling priors, it can disentangle "between" dynamic factors. We conducted experiments to show that FAVAE can extract disentangled dynamic factors.

Via

Access Paper or Ask Questions