Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chris Burgess

Unsupervised Object-Based Transition Models for 3D Partially Observable Environments

Mar 08, 2021

Antonia Creswell, Rishabh Kabra, Chris Burgess, Murray Shanahan

Figure 1 for Unsupervised Object-Based Transition Models for 3D Partially Observable Environments

Figure 2 for Unsupervised Object-Based Transition Models for 3D Partially Observable Environments

Figure 3 for Unsupervised Object-Based Transition Models for 3D Partially Observable Environments

Figure 4 for Unsupervised Object-Based Transition Models for 3D Partially Observable Environments

Abstract:We present a slot-wise, object-based transition model that decomposes a scene into objects, aligns them (with respect to a slot-wise object memory) to maintain a consistent order across time, and predicts how those objects evolve over successive frames. The model is trained end-to-end without supervision using losses at the level of the object-structured representation rather than pixels. Thanks to its alignment module, the model deals properly with two issues that are not handled satisfactorily by other transition models, namely object persistence and object identity. We show that the combination of an object-level loss and correct object alignment over time enables the model to outperform a state-of-the-art baseline, and allows it to deal well with object occlusion and re-appearance in partially observable environments.

Via

Access Paper or Ask Questions

AlignNet: Unsupervised Entity Alignment

Jul 21, 2020

Antonia Creswell, Kyriacos Nikiforou, Oriol Vinyals, Andre Saraiva, Rishabh Kabra, Loic Matthey, Chris Burgess, Malcolm Reynolds, Richard Tanburn, Marta Garnelo(+1 more)

Figure 1 for AlignNet: Unsupervised Entity Alignment

Figure 2 for AlignNet: Unsupervised Entity Alignment

Figure 3 for AlignNet: Unsupervised Entity Alignment

Figure 4 for AlignNet: Unsupervised Entity Alignment

Abstract:Recently developed deep learning models are able to learn to segment scenes into component objects without supervision. This opens many new and exciting avenues of research, allowing agents to take objects (or entities) as inputs, rather that pixels. Unfortunately, while these models provide excellent segmentation of a single frame, they do not keep track of how objects segmented at one time-step correspond (or align) to those at a later time-step. The alignment (or correspondence) problem has impeded progress towards using object representations in downstream tasks. In this paper we take steps towards solving the alignment problem, presenting the AlignNet, an unsupervised alignment module.

Via

Access Paper or Ask Questions

Multi-Object Representation Learning with Iterative Variational Inference

Mar 01, 2019

Klaus Greff, Raphaël Lopez Kaufmann, Rishab Kabra, Nick Watters, Chris Burgess, Daniel Zoran, Loic Matthey, Matthew Botvinick, Alexander Lerchner

Figure 1 for Multi-Object Representation Learning with Iterative Variational Inference

Figure 2 for Multi-Object Representation Learning with Iterative Variational Inference

Figure 3 for Multi-Object Representation Learning with Iterative Variational Inference

Figure 4 for Multi-Object Representation Learning with Iterative Variational Inference

Abstract:Human perception is structured around objects which form the basis for our higher-level cognition and impressive systematic generalization abilities. Yet most work on representation learning focuses on feature learning without even considering multiple objects, or treats segmentation as an (often supervised) preprocessing step. Instead, we argue for the importance of learning to segment and represent objects jointly. We demonstrate that, starting from the simple assumption that a scene is composed of multiple entities, it is possible to learn to segment images into interpretable objects with disentangled representations. Our method learns -- without supervision -- to inpaint occluded parts, and extrapolates to scenes with more objects and to unseen objects with novel feature combinations. We also show that, due to the use of iterative variational inference, our system is able to learn multi-modal posteriors for ambiguous inputs and extends naturally to sequences.

Via

Access Paper or Ask Questions