Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nathan Michlo

Accounting for the Sequential Nature of States to Learn Features for Reinforcement Learning

May 12, 2022

Nathan Michlo, Devon Jarvis, Richard Klein, Steven James

Figure 1 for Accounting for the Sequential Nature of States to Learn Features for Reinforcement Learning

Figure 2 for Accounting for the Sequential Nature of States to Learn Features for Reinforcement Learning

Figure 3 for Accounting for the Sequential Nature of States to Learn Features for Reinforcement Learning

Abstract:In this work, we investigate the properties of data that cause popular representation learning approaches to fail. In particular, we find that in environments where states do not significantly overlap, variational autoencoders (VAEs) fail to learn useful features. We demonstrate this failure in a simple gridworld domain, and then provide a solution in the form of metric learning. However, metric learning requires supervision in the form of a distance function, which is absent in reinforcement learning. To overcome this, we leverage the sequential nature of states in a replay buffer to approximate a distance metric and provide a weak supervision signal, under the assumption that temporally close states are also semantically similar. We modify a VAE with triplet loss and demonstrate that this approach is able to learn useful features for downstream tasks, without additional supervision, in environments where standard VAEs fail.

* arXiv admin note: text overlap with arXiv:2202.13341

Via

Access Paper or Ask Questions

Data Overlap: A Prerequisite For Disentanglement

Feb 27, 2022

Nathan Michlo, Steven James, Richard Klein

Figure 1 for Data Overlap: A Prerequisite For Disentanglement

Figure 2 for Data Overlap: A Prerequisite For Disentanglement

Figure 3 for Data Overlap: A Prerequisite For Disentanglement

Figure 4 for Data Overlap: A Prerequisite For Disentanglement

Abstract:Learning disentangled representations with variational autoencoders (VAEs) is often attributed to the regularisation component of the loss. In this work, we highlight the interaction between data and the reconstruction term of the loss as the main contributor to disentanglement in VAEs. We note that standardised benchmark datasets are constructed in a way that is conducive to learning what appear to be disentangled representations. We design an intuitive adversarial dataset that exploits this mechanism to break existing state-of-the-art disentanglement frameworks. Finally, we provide solutions in the form of a modified reconstruction loss suggesting that VAEs are accidental distance learners.

* 13 pages, 12 figures, 4 tables

Via

Access Paper or Ask Questions