Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Luisa Polania

Learning Disentangled Prompts for Compositional Image Synthesis

Jun 01, 2023

Kihyuk Sohn, Albert Shaw, Yuan Hao, Han Zhang, Luisa Polania, Huiwen Chang, Lu Jiang, Irfan Essa

Figure 1 for Learning Disentangled Prompts for Compositional Image Synthesis

Figure 2 for Learning Disentangled Prompts for Compositional Image Synthesis

Figure 3 for Learning Disentangled Prompts for Compositional Image Synthesis

Figure 4 for Learning Disentangled Prompts for Compositional Image Synthesis

Abstract:We study domain-adaptive image synthesis, the problem of teaching pretrained image generative models a new style or concept from as few as one image to synthesize novel images, to better understand the compositional image synthesis. We present a framework that leverages a pretrained class-conditional generation model and visual prompt tuning. Specifically, we propose a novel source class distilled visual prompt that learns disentangled prompts of semantic (e.g., class) and domain (e.g., style) from a few images. Learned domain prompt is then used to synthesize images of any classes in the style of target domain. We conduct studies on various target domains with the number of images ranging from one to a few to many, and show qualitative results which show the compositional generalization of our method. Moreover, we show that our method can help improve zero-shot domain adaptation classification accuracy.

* tech report

Via

Access Paper or Ask Questions

Visual Prompt Tuning for Generative Transfer Learning

Oct 03, 2022

Kihyuk Sohn, Yuan Hao, José Lezama, Luisa Polania, Huiwen Chang, Han Zhang, Irfan Essa, Lu Jiang

Figure 1 for Visual Prompt Tuning for Generative Transfer Learning

Figure 2 for Visual Prompt Tuning for Generative Transfer Learning

Figure 3 for Visual Prompt Tuning for Generative Transfer Learning

Figure 4 for Visual Prompt Tuning for Generative Transfer Learning

Abstract:Transferring knowledge from an image synthesis model trained on a large dataset is a promising direction for learning generative image models from various domains efficiently. While previous works have studied GAN models, we present a recipe for learning vision transformers by generative knowledge transfer. We base our framework on state-of-the-art generative vision transformers that represent an image as a sequence of visual tokens to the autoregressive or non-autoregressive transformers. To adapt to a new domain, we employ prompt tuning, which prepends learnable tokens called prompt to the image token sequence, and introduce a new prompt design for our task. We study on a variety of visual domains, including visual task adaptation benchmark~\cite{zhai2019large}, with varying amount of training images, and show effectiveness of knowledge transfer and a significantly better image generation quality over existing works.

* technical report

Via

Access Paper or Ask Questions

Ordinal Regression using Noisy Pairwise Comparisons for Body Mass Index Range Estimation

Nov 08, 2018

Luisa Polania, Dongning Wang, Glenn Fung

Figure 1 for Ordinal Regression using Noisy Pairwise Comparisons for Body Mass Index Range Estimation

Figure 2 for Ordinal Regression using Noisy Pairwise Comparisons for Body Mass Index Range Estimation

Figure 3 for Ordinal Regression using Noisy Pairwise Comparisons for Body Mass Index Range Estimation

Figure 4 for Ordinal Regression using Noisy Pairwise Comparisons for Body Mass Index Range Estimation

Abstract:Ordinal regression aims to classify instances into ordinal categories. In this paper, body mass index (BMI) category estimation from facial images is cast as an ordinal regression problem. In particular, noisy binary search algorithms based on pairwise comparisons are employed to exploit the ordinal relationship among BMI categories. Comparisons are performed with Siamese architectures, one of which uses the Bradley-Terry model probabilities as target. The Bradley-Terry model is an approach to describe probabilities of the possible outcomes when elements of a set are repeatedly compared with one another in pairs. Experimental results show that our approach outperforms classification and regression-based methods at estimating BMI categories.

* Paper accepted for publication at the 2019 IEEE Winter Conference on Applications of Computer Vision (WACV 2019)

Via

Access Paper or Ask Questions