Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alperen Bag

Discovering Multiple and Diverse Directions for Cognitive Image Properties

Feb 23, 2022

Umut Kocasari, Alperen Bag, Oguz Kaan Yuksel, Pinar Yanardag

Figure 1 for Discovering Multiple and Diverse Directions for Cognitive Image Properties

Figure 2 for Discovering Multiple and Diverse Directions for Cognitive Image Properties

Figure 3 for Discovering Multiple and Diverse Directions for Cognitive Image Properties

Figure 4 for Discovering Multiple and Diverse Directions for Cognitive Image Properties

Abstract:Recent research has shown that it is possible to find interpretable directions in the latent spaces of pre-trained GANs. These directions enable controllable generation and support a variety of semantic editing operations. While previous work has focused on discovering a single direction that performs a desired editing operation such as zoom-in, limited work has been done on the discovery of multiple and diverse directions that can achieve the desired edit. In this work, we propose a novel framework that discovers multiple and diverse directions for a given property of interest. In particular, we focus on the manipulation of cognitive properties such as Memorability, Emotional Valence and Aesthetics. We show with extensive experiments that our method successfully manipulates these properties while producing diverse outputs. Our project page and source code can be found at http://catlab-team.github.io/latentcognitive.

Via

Access Paper or Ask Questions

Exploring Latent Dimensions of Crowd-sourced Creativity

Dec 13, 2021

Umut Kocasari, Alperen Bag, Efehan Atici, Pinar Yanardag

Figure 1 for Exploring Latent Dimensions of Crowd-sourced Creativity

Figure 2 for Exploring Latent Dimensions of Crowd-sourced Creativity

Figure 3 for Exploring Latent Dimensions of Crowd-sourced Creativity

Abstract:Recently, the discovery of interpretable directions in the latent spaces of pre-trained GANs has become a popular topic. While existing works mostly consider directions for semantic image manipulations, we focus on an abstract property: creativity. Can we manipulate an image to be more or less creative? We build our work on the largest AI-based creativity platform, Artbreeder, where users can generate images using pre-trained GAN models. We explore the latent dimensions of images generated on this platform and present a novel framework for manipulating images to make them more creative. Our code and dataset are available at http://github.com/catlab-team/latentcreative.

* 5th Workshop on Machine Learning for Creativity and Design (NeurIPS 2021), Sydney, Australia

Via

Access Paper or Ask Questions