Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation

Jun 01, 2023

Shaozhe Hao, Kai Han, Shihao Zhao, Kwan-Yee K. Wong

Figure 1 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation

Figure 2 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation

Figure 3 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation

Figure 4 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation

Share this with someone who'll enjoy it:

Abstract:Personalized text-to-image generation using diffusion models has recently been proposed and attracted lots of attention. Given a handful of images containing a novel concept (e.g., a unique toy), we aim to tune the generative model to capture fine visual details of the novel concept and generate photorealistic images following a text condition. We present a plug-in method, named ViCo, for fast and lightweight personalized generation. Specifically, we propose an image attention module to condition the diffusion process on the patch-wise visual semantics. We introduce an attention-based object mask that comes almost at no cost from the attention module. In addition, we design a simple regularization based on the intrinsic properties of text-image attention maps to alleviate the common overfitting degradation. Unlike many existing models, our method does not finetune any parameters of the original diffusion model. This allows more flexible and transferable model deployment. With only light parameter training (~6% of the diffusion U-Net), our method achieves comparable or even better performance than all state-of-the-art models both qualitatively and quantitatively.

* Under review

View paper on

Share this with someone who'll enjoy it:

Title:ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation

Paper and Code