Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Oguz Kaan Yuksel

Discovering Multiple and Diverse Directions for Cognitive Image Properties

Feb 23, 2022

Umut Kocasari, Alperen Bag, Oguz Kaan Yuksel, Pinar Yanardag

Figure 1 for Discovering Multiple and Diverse Directions for Cognitive Image Properties

Figure 2 for Discovering Multiple and Diverse Directions for Cognitive Image Properties

Figure 3 for Discovering Multiple and Diverse Directions for Cognitive Image Properties

Figure 4 for Discovering Multiple and Diverse Directions for Cognitive Image Properties

Abstract:Recent research has shown that it is possible to find interpretable directions in the latent spaces of pre-trained GANs. These directions enable controllable generation and support a variety of semantic editing operations. While previous work has focused on discovering a single direction that performs a desired editing operation such as zoom-in, limited work has been done on the discovery of multiple and diverse directions that can achieve the desired edit. In this work, we propose a novel framework that discovers multiple and diverse directions for a given property of interest. In particular, we focus on the manipulation of cognitive properties such as Memorability, Emotional Valence and Aesthetics. We show with extensive experiments that our method successfully manipulates these properties while producing diverse outputs. Our project page and source code can be found at http://catlab-team.github.io/latentcognitive.

Via

Access Paper or Ask Questions

Semantic Perturbations with Normalizing Flows for Improved Generalization

Aug 18, 2021

Oguz Kaan Yuksel, Sebastian U. Stich, Martin Jaggi, Tatjana Chavdarova

Figure 1 for Semantic Perturbations with Normalizing Flows for Improved Generalization

Figure 2 for Semantic Perturbations with Normalizing Flows for Improved Generalization

Figure 3 for Semantic Perturbations with Normalizing Flows for Improved Generalization

Figure 4 for Semantic Perturbations with Normalizing Flows for Improved Generalization

Abstract:Data augmentation is a widely adopted technique for avoiding overfitting when training deep neural networks. However, this approach requires domain-specific knowledge and is often limited to a fixed set of hard-coded transformations. Recently, several works proposed to use generative models for generating semantically meaningful perturbations to train a classifier. However, because accurate encoding and decoding are critical, these methods, which use architectures that approximate the latent-variable inference, remained limited to pilot studies on small datasets. Exploiting the exactly reversible encoder-decoder structure of normalizing flows, we perform on-manifold perturbations in the latent space to define fully unsupervised data augmentations. We demonstrate that such perturbations match the performance of advanced data augmentation techniques -- reaching 96.6% test accuracy for CIFAR-10 using ResNet-18 and outperform existing methods, particularly in low data regimes -- yielding 10--25% relative improvement of test accuracy from classical training. We find that our latent adversarial perturbations adaptive to the classifier throughout its training are most effective, yielding the first test accuracy improvement results on real-world datasets -- CIFAR-10/100 -- via latent-space perturbations.

* In Proceedings of the IEEE International Conference on Computer Vision

Via

Access Paper or Ask Questions