Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daniela Mihai

Physically Embodied Deep Image Optimisation

Jan 20, 2022

Daniela Mihai, Jonathon Hare

Figure 1 for Physically Embodied Deep Image Optimisation

Figure 2 for Physically Embodied Deep Image Optimisation

Figure 3 for Physically Embodied Deep Image Optimisation

Figure 4 for Physically Embodied Deep Image Optimisation

Abstract:Physical sketches are created by learning programs to control a drawing robot. A differentiable rasteriser is used to optimise sets of drawing strokes to match an input image, using deep networks to provide an encoding for which we can compute a loss. The optimised drawing primitives can then be translated into G-code commands which command a robot to draw the image using drawing instruments such as pens and pencils on a physical support medium.

* 5th Workshop on Machine Learning for Creativity and Design of the Neural Information Processing Systems (NeurIPS) 2021 Conference

Via

Access Paper or Ask Questions

Shared Visual Representations of Drawing for Communication: How do different biases affect human interpretability and intent?

Oct 15, 2021

Daniela Mihai, Jonathon Hare

Figure 1 for Shared Visual Representations of Drawing for Communication: How do different biases affect human interpretability and intent?

Figure 2 for Shared Visual Representations of Drawing for Communication: How do different biases affect human interpretability and intent?

Figure 3 for Shared Visual Representations of Drawing for Communication: How do different biases affect human interpretability and intent?

Figure 4 for Shared Visual Representations of Drawing for Communication: How do different biases affect human interpretability and intent?

Abstract:We present an investigation into how representational losses can affect the drawings produced by artificial agents playing a communication game. Building upon recent advances, we show that a combination of powerful pretrained encoder networks, with appropriate inductive biases, can lead to agents that draw recognisable sketches, whilst still communicating well. Further, we start to develop an approach to help automatically analyse the semantic content being conveyed by a sketch and demonstrate that current approaches to inducing perceptual biases lead to a notion of objectness being a key feature despite the agent training being self-supervised.

Via

Access Paper or Ask Questions

Learning to Draw: Emergent Communication through Sketching

Jun 03, 2021

Daniela Mihai, Jonathon Hare

Figure 1 for Learning to Draw: Emergent Communication through Sketching

Figure 2 for Learning to Draw: Emergent Communication through Sketching

Figure 3 for Learning to Draw: Emergent Communication through Sketching

Figure 4 for Learning to Draw: Emergent Communication through Sketching

Abstract:Evidence that visual communication preceded written language and provided a basis for it goes back to prehistory, in forms such as cave and rock paintings depicting traces of our distant ancestors. Emergent communication research has sought to explore how agents can learn to communicate in order to collaboratively solve tasks. Existing research has focused on language, with a learned communication channel transmitting sequences of discrete tokens between the agents. In this work, we explore a visual communication channel between agents that are allowed to draw with simple strokes. Our agents are parameterised by deep neural networks, and the drawing procedure is differentiable, allowing for end-to-end training. In the framework of a referential communication game, we demonstrate that agents can not only successfully learn to communicate by drawing, but with appropriate inductive biases, can do so in a fashion that humans can interpret. We hope to encourage future research to consider visual communication as a more flexible and directly interpretable alternative of training collaborative agents.

Via

Access Paper or Ask Questions

Differentiable Drawing and Sketching

Mar 30, 2021

Daniela Mihai, Jonathon Hare

Figure 1 for Differentiable Drawing and Sketching

Figure 2 for Differentiable Drawing and Sketching

Figure 3 for Differentiable Drawing and Sketching

Figure 4 for Differentiable Drawing and Sketching

Abstract:We present a bottom-up differentiable relaxation of the process of drawing points, lines and curves into a pixel raster. Our approach arises from the observation that rasterising a pixel in an image given parameters of a primitive can be reformulated in terms of the primitive's distance transform, and then relaxed to allow the primitive's parameters to be learned. This relaxation allows end-to-end differentiable programs and deep networks to be learned and optimised and provides several building blocks that allow control over how a compositional drawing process is modelled. We emphasise the bottom-up nature of our proposed approach, which allows for drawing operations to be composed in ways that can mimic the physical reality of drawing rather than being tied to, for example, approaches in modern computer graphics. With the proposed approach we demonstrate how sketches can be generated by directly optimising against photographs and how auto-encoders can be built to transform rasterised handwritten digits into vectors without supervision. Extensive experimental results highlight the power of this approach under different modelling assumptions for drawing tasks.

Via

Access Paper or Ask Questions

The emergence of visual semantics through communication games

Jan 25, 2021

Daniela Mihai, Jonathon Hare

Figure 1 for The emergence of visual semantics through communication games

Figure 2 for The emergence of visual semantics through communication games

Figure 3 for The emergence of visual semantics through communication games

Figure 4 for The emergence of visual semantics through communication games

Abstract:The emergence of communication systems between agents which learn to play referential signalling games with realistic images has attracted a lot of attention recently. The majority of work has focused on using fixed, pretrained image feature extraction networks which potentially bias the information the agents learn to communicate. In this work, we consider a signalling game setting in which a `sender' agent must communicate the information about an image to a `receiver' who must select the correct image from many distractors. We investigate the effect of the feature extractor's weights and of the task being solved on the visual semantics learned by the models. We first demonstrate to what extent the use of pretrained feature extraction networks inductively bias the visual semantics conveyed by emergent communication channel and quantify the visual semantics that are induced. We then go on to explore ways in which inductive biases can be introduced to encourage the emergence of semantically meaningful communication without the need for any form of supervised pretraining of the visual feature extractor. We impose various augmentations to the input images and additional tasks in the game with the aim to induce visual representations which capture conceptual properties of images. Through our experiments, we demonstrate that communication systems which capture visual semantics can be learned in a completely self-supervised manner by playing the right types of game. Our work bridges a gap between emergent communication research and self-supervised feature learning.

* arXiv admin note: text overlap with arXiv:1911.05546

Via

Access Paper or Ask Questions

How Convolutional Neural Network Architecture Biases Learned Opponency and Colour Tuning

Oct 06, 2020

Ethan Harris, Daniela Mihai, Jonathon Hare

Figure 1 for How Convolutional Neural Network Architecture Biases Learned Opponency and Colour Tuning

Figure 2 for How Convolutional Neural Network Architecture Biases Learned Opponency and Colour Tuning

Figure 3 for How Convolutional Neural Network Architecture Biases Learned Opponency and Colour Tuning

Figure 4 for How Convolutional Neural Network Architecture Biases Learned Opponency and Colour Tuning

Abstract:Recent work suggests that changing Convolutional Neural Network (CNN) architecture by introducing a bottleneck in the second layer can yield changes in learned function. To understand this relationship fully requires a way of quantitatively comparing trained networks. The fields of electrophysiology and psychophysics have developed a wealth of methods for characterising visual systems which permit such comparisons. Inspired by these methods, we propose an approach to obtaining spatial and colour tuning curves for convolutional neurons, which can be used to classify cells in terms of their spatial and colour opponency. We perform these classifications for a range of CNNs with different depths and bottleneck widths. Our key finding is that networks with a bottleneck show a strong functional organisation: almost all cells in the bottleneck layer become both spatially and colour opponent, cells in the layer following the bottleneck become non-opponent. The colour tuning data can further be used to form a rich understanding of how colour is encoded by a network. As a concrete demonstration, we show that shallower networks without a bottleneck learn a complex non-linear colour system, whereas deeper networks with tight bottlenecks learn a simple channel opponent code in the bottleneck layer. We further develop a method of obtaining a hue sensitivity curve for a trained CNN which enables high level insights that complement the low level findings from the colour tuning data. We go on to train a series of networks under different conditions to ascertain the robustness of the discussed results. Ultimately, our methods and findings coalesce with prior art, strengthening our ability to interpret trained CNNs and furthering our understanding of the connection between architecture and learned representation. Code for all experiments is available at https://github.com/ecs-vlc/opponency.

* Final version; Accepted for publication in Neural Computation

Via

Access Paper or Ask Questions

Avoiding hashing and encouraging visual semantics in referential emergent language games

Nov 13, 2019

Daniela Mihai, Jonathon Hare

Figure 1 for Avoiding hashing and encouraging visual semantics in referential emergent language games

Figure 2 for Avoiding hashing and encouraging visual semantics in referential emergent language games

Figure 3 for Avoiding hashing and encouraging visual semantics in referential emergent language games

Figure 4 for Avoiding hashing and encouraging visual semantics in referential emergent language games

Abstract:There has been an increasing interest in the area of emergent communication between agents which learn to play referential signalling games with realistic images. In this work, we consider the signalling game setting of Havrylov and Titov and investigate the effect of the feature extractor's weights and of the task being solved on the visual semantics learned or captured by the models. We impose various augmentation to the input images and additional tasks in the game with the aim to induce visual representations which capture conceptual properties of images. Through our set of experiments, we demonstrate that communication systems which capture visual semantics can be learned in a completely self-supervised manner by playing the right types of game.

* 4 pages, presented at Emergent Communication: Towards Natural Language workshop (NeurIPS 2019)

Via

Access Paper or Ask Questions

Spatial and Colour Opponency in Anatomically Constrained Deep Networks

Oct 14, 2019

Ethan Harris, Daniela Mihai, Jonathon Hare

Figure 1 for Spatial and Colour Opponency in Anatomically Constrained Deep Networks

Figure 2 for Spatial and Colour Opponency in Anatomically Constrained Deep Networks

Figure 3 for Spatial and Colour Opponency in Anatomically Constrained Deep Networks

Figure 4 for Spatial and Colour Opponency in Anatomically Constrained Deep Networks

Abstract:Colour vision has long fascinated scientists, who have sought to understand both the physiology of the mechanics of colour vision and the psychophysics of colour perception. We consider representations of colour in anatomically constrained convolutional deep neural networks. Following ideas from neuroscience, we classify cells in early layers into groups relating to their spectral and spatial functionality. We show the emergence of single and double opponent cells in our networks and characterise how the distribution of these cells changes under the constraint of a retinal bottleneck. Our experiments not only open up a new understanding of how deep networks process spatial and colour information, but also provide new tools to help understand the black box of deep learning. The code for all experiments is avaialable at \url{https://github.com/ecs-vlc/opponency}.

Via

Access Paper or Ask Questions