Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Memory-based Deep Reinforcement Learning for POMDP

Feb 25, 2021

Lingheng Meng, Rob Gorbet, Dana Kulić

Figure 1 for Memory-based Deep Reinforcement Learning for POMDP

Figure 2 for Memory-based Deep Reinforcement Learning for POMDP

Figure 3 for Memory-based Deep Reinforcement Learning for POMDP

Figure 4 for Memory-based Deep Reinforcement Learning for POMDP

Share this with someone who'll enjoy it:

Abstract:A promising characteristic of Deep Reinforcement Learning (DRL) is its capability to learn optimal policy in an end-to-end manner without relying on feature engineering. However, most approaches assume a fully observable state space, i.e. fully observable Markov Decision Process (MDP). In real-world robotics, this assumption is unpractical, because of the sensor issues such as sensors' capacity limitation and sensor noise, and the lack of knowledge about if the observation design is complete or not. These scenarios lead to Partially Observable MDP (POMDP) and need special treatment. In this paper, we propose Long-Short-Term-Memory-based Twin Delayed Deep Deterministic Policy Gradient (LSTM-TD3) by introducing a memory component to TD3, and compare its performance with other DRL algorithms in both MDPs and POMDPs. Our results demonstrate the significant advantages of the memory component in addressing POMDPs, including the ability to handle missing and noisy observation data.

* 15 pages, 13 figures

View paper on

Share this with someone who'll enjoy it:

Title:Memory-based Deep Reinforcement Learning for POMDP

Paper and Code