Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ProSpect: Expanded Conditioning for the Personalization of Attribute-aware Image Generation

May 25, 2023

Yuxin Zhang, Weiming Dong, Fan Tang, Nisha Huang, Haibin Huang, Chongyang Ma, Tong-Yee Lee, Oliver Deussen, Changsheng Xu

Figure 1 for ProSpect: Expanded Conditioning for the Personalization of Attribute-aware Image Generation

Figure 2 for ProSpect: Expanded Conditioning for the Personalization of Attribute-aware Image Generation

Figure 3 for ProSpect: Expanded Conditioning for the Personalization of Attribute-aware Image Generation

Figure 4 for ProSpect: Expanded Conditioning for the Personalization of Attribute-aware Image Generation

Share this with someone who'll enjoy it:

Abstract:Personalizing generative models offers a way to guide image generation with user-provided references. Current personalization methods can invert an object or concept into the textual conditioning space and compose new natural sentences for text-to-image diffusion models. However, representing and editing specific visual attributes like material, style, layout, etc. remains a challenge, leading to a lack of disentanglement and editability. To address this, we propose a novel approach that leverages the step-by-step generation process of diffusion models, which generate images from low- to high-frequency information, providing a new perspective on representing, generating, and editing images. We develop Prompt Spectrum Space P*, an expanded textual conditioning space, and a new image representation method called ProSpect. ProSpect represents an image as a collection of inverted textual token embeddings encoded from per-stage prompts, where each prompt corresponds to a specific generation stage (i.e., a group of consecutive steps) of the diffusion model. Experimental results demonstrate that P* and ProSpect offer stronger disentanglement and controllability compared to existing methods. We apply ProSpect in various personalized attribute-aware image generation applications, such as image/text-guided material/style/layout transfer/editing, achieving previously unattainable results with a single image input without fine-tuning the diffusion models.

View paper on

Share this with someone who'll enjoy it:

Title:ProSpect: Expanded Conditioning for the Personalization of Attribute-aware Image Generation

Paper and Code