Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Representation Gap in Deep Reinforcement Learning

May 29, 2022

Qiang He, Huangyuan Su, Jieyu Zhang, Xinwen Hou

Figure 1 for Representation Gap in Deep Reinforcement Learning

Figure 2 for Representation Gap in Deep Reinforcement Learning

Figure 3 for Representation Gap in Deep Reinforcement Learning

Figure 4 for Representation Gap in Deep Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Deep reinforcement learning gives the promise that an agent learns good policy from high-dimensional information. Whereas representation learning removes irrelevant and redundant information and retains pertinent information. We consider the representation capacity of action value function and theoretically reveal its inherent property, \textit{representation gap} with its target action value function. This representation gap is favorable. However, through illustrative experiments, we show that the representation of action value function grows similarly compared with its target value function, i.e. the undesirable inactivity of the representation gap (\textit{representation overlap}). Representation overlap results in a loss of representation capacity, which further leads to sub-optimal learning performance. To activate the representation gap, we propose a simple but effective framework \underline{P}olicy \underline{O}ptimization from \underline{P}reventing \underline{R}epresentation \underline{O}verlaps (POPRO), which regularizes the policy evaluation phase through differing the representation of action value function from its target. We also provide the convergence rate guarantee of POPRO. We evaluate POPRO on gym continuous control suites. The empirical results show that POPRO using pixel inputs outperforms or parallels the sample-efficiency of methods that use state-based features.

* 24 pages, 6 figures

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Representation Gap in Deep Reinforcement Learning

Paper and Code