Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Jan 06, 2021

Takahisa Imagawa, Takuya Hiraoka, Yoshimasa Tsuruoka

Figure 1 for Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Figure 2 for Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Figure 3 for Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Figure 4 for Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Share this with someone who'll enjoy it:

Abstract:Meta-reinforcement learning (RL) addresses the problem of sample inefficiency in deep RL by using experience obtained in past tasks for a new task to be solved. However, most meta-RL methods require partially or fully on-policy data, i.e., they cannot reuse the data collected by past policies, which hinders the improvement of sample efficiency. To alleviate this problem, we propose a novel off-policy meta-RL method, embedding learning and evaluation of uncertainty (ELUE). An ELUE agent is characterized by the learning of a feature embedding space shared among tasks. It learns a belief model over the embedding space and a belief-conditional policy and Q-function. Then, for a new task, it collects data by the pretrained policy, and updates its belief based on the belief model. Thanks to the belief update, the performance can be improved with a small amount of data. In addition, it updates the parameters of the neural networks to adjust the pretrained relationships when there are enough data. We demonstrate that ELUE outperforms state-of-the-art meta RL methods through experiments on meta-RL benchmarks.

* 14pages

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces

Paper and Code