Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Informed POMDP: Leveraging Additional Information in Model-Based RL

Jun 24, 2023

Gaspard Lambrechts, Adrien Bolland, Damien Ernst

Figure 1 for Informed POMDP: Leveraging Additional Information in Model-Based RL

Figure 2 for Informed POMDP: Leveraging Additional Information in Model-Based RL

Figure 3 for Informed POMDP: Leveraging Additional Information in Model-Based RL

Figure 4 for Informed POMDP: Leveraging Additional Information in Model-Based RL

Share this with someone who'll enjoy it:

Abstract:In this work, we generalize the problem of learning through interaction in a POMDP by accounting for eventual additional information available at training time. First, we introduce the informed POMDP, a new learning paradigm offering a clear distinction between the training information and the execution observation. Next, we propose an objective for learning a sufficient statistic from the history for the optimal control that leverages this information. We then show that this informed objective consists of learning an environment model from which we can sample latent trajectories. Finally, we show for the Dreamer algorithm that the convergence speed of the policies is sometimes greatly improved on several environments by using this informed environment model. Those results and the simplicity of the proposed adaptation advocate for a systematic consideration of eventual additional information when learning in a POMDP using model-based RL.

* In ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 2023. 8 pages, 13 pages total, 8 figures

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Informed POMDP: Leveraging Additional Information in Model-Based RL

Paper and Code