Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Bridging State and History Representations: Understanding Self-Predictive RL

Jan 17, 2024

Tianwei Ni, Benjamin Eysenbach, Erfan Seyedsalehi, Michel Ma, Clement Gehring, Aditya Mahajan, Pierre-Luc Bacon

Figure 1 for Bridging State and History Representations: Understanding Self-Predictive RL

Figure 2 for Bridging State and History Representations: Understanding Self-Predictive RL

Figure 3 for Bridging State and History Representations: Understanding Self-Predictive RL

Figure 4 for Bridging State and History Representations: Understanding Self-Predictive RL

Share this with someone who'll enjoy it:

Abstract:Representations are at the core of all deep reinforcement learning (RL) methods for both Markov decision processes (MDPs) and partially observable Markov decision processes (POMDPs). Many representation learning methods and theoretical frameworks have been developed to understand what constitutes an effective representation. However, the relationships between these methods and the shared properties among them remain unclear. In this paper, we show that many of these seemingly distinct methods and frameworks for state and history abstractions are, in fact, based on a common idea of self-predictive abstraction. Furthermore, we provide theoretical insights into the widely adopted objectives and optimization, such as the stop-gradient technique, in learning self-predictive representations. These findings together yield a minimalist algorithm to learn self-predictive representations for states and histories. We validate our theories by applying our algorithm to standard MDPs, MDPs with distractors, and POMDPs with sparse rewards. These findings culminate in a set of practical guidelines for RL practitioners.

* ICLR 2024 (Poster). Code is available at https://github.com/twni2016/self-predictive-rl

View paper on

Share this with someone who'll enjoy it:

Title:Bridging State and History Representations: Understanding Self-Predictive RL

Paper and Code