Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Matthew Painter

Linear Disentangled Representations and Unsupervised Action Estimation

Aug 18, 2020

Matthew Painter, Jonathon Hare, Adam Prugel-Bennett

Figure 1 for Linear Disentangled Representations and Unsupervised Action Estimation

Figure 2 for Linear Disentangled Representations and Unsupervised Action Estimation

Figure 3 for Linear Disentangled Representations and Unsupervised Action Estimation

Figure 4 for Linear Disentangled Representations and Unsupervised Action Estimation

Abstract:Disentangled representation learning has seen a surge in interest over recent times, generally focusing on new models to optimise one of many disparate disentanglement metrics. It was only with Symmetry Based Disentangled Representation Learning that a robust mathematical framework was introduced to define precisely what is meant by a "linear disentangled representation". This framework determines that such representations would depend on a particular decomposition of the symmetry group acting on the data, showing that actions would manifest through irreducible group representations acting on independent representational subspaces. ForwardVAE subsequently proposed the first model to induce and demonstrate a linear disentangled representation in a VAE model. In this work we empirically show that linear disentangled representations are not present in standard VAE models and that they instead require altering the loss landscape to induce them. We proceed to show that such representations are a desirable property with regard to classical disentanglement metrics. Finally we propose a method to induce irreducible representations which forgoes the need for labelled action sequences, as was required by prior work. We explore a number of properties of this method, including the ability to learn from action sequences without knowledge of intermediate states.

Via

Access Paper or Ask Questions

Understanding and Enhancing Mixed Sample Data Augmentation

Feb 27, 2020

Ethan Harris, Antonia Marcu, Matthew Painter, Mahesan Niranjan, Adam Prügel-Bennett, Jonathon Hare

Figure 1 for Understanding and Enhancing Mixed Sample Data Augmentation

Figure 2 for Understanding and Enhancing Mixed Sample Data Augmentation

Figure 3 for Understanding and Enhancing Mixed Sample Data Augmentation

Figure 4 for Understanding and Enhancing Mixed Sample Data Augmentation

Abstract:Mixed Sample Data Augmentation (MSDA) has received increasing attention in recent years, with many successful variants such as MixUp and CutMix. Following insight on the efficacy of CutMix in particular, we propose FMix, an MSDA that uses binary masks obtained by applying a threshold to low frequency images sampled from Fourier space. FMix improves performance over MixUp and CutMix for a number of state-of-the-art models across a range of data sets and problem settings. We go on to analyse MixUp, CutMix, and FMix from an information theoretic perspective, characterising learned models in terms of how they progressively compress the input with depth. Ultimately, our analyses allow us to decouple two complementary properties of augmentations, and present a unified framework for reasoning about MSDA. Code for all experiments is available at https://github.com/ecs-vlc/FMix.

* 15 pages, code available at https://github.com/ecs-vlc/FMix

Via

Access Paper or Ask Questions

Torchbearer: A Model Fitting Library for PyTorch

Sep 10, 2018

Ethan Harris, Matthew Painter, Jonathon Hare

Abstract:We introduce torchbearer, a model fitting library for pytorch aimed at researchers working on deep learning or differentiable programming. The torchbearer library provides a high level metric and callback API that can be used for a wide range of applications. We also include a series of built in callbacks that can be used for: model persistence, learning rate decay, logging, data visualization and more. The extensive documentation includes an example library for deep learning and dynamic programming problems and can be found at http://torchbearer.readthedocs.io. The code is licensed under the MIT License and available at https://github.com/ecs-vlc/torchbearer.

* 5 pages

Via

Access Paper or Ask Questions