Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation

Feb 27, 2023

Yuxiang Wei, Yabo Zhang, Zhilong Ji, Jinfeng Bai, Lei Zhang, Wangmeng Zuo

Figure 1 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation

Figure 2 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation

Figure 3 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation

Figure 4 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation

Share this with someone who'll enjoy it:

Abstract:Despite unprecedented ability in imaginary creation, large text-to-image models are further expected to express customized concepts. Existing works generally learn such concepts in an optimization-based manner, yet bringing excessive computation or memory burden. In this paper, we instead propose a learning-based encoder for fast and accurate concept customization, which consists of global and local mapping networks. In specific, the global mapping network separately projects the hierarchical features of a given image into multiple ``new'' words in the textual word embedding space, i.e., one primary word for well-editable concept and other auxiliary words to exclude irrelevant disturbances (e.g., background). In the meantime, a local mapping network injects the encoded patch features into cross attention layers to provide omitted details, without sacrificing the editability of primary concepts. We compare our method with prior optimization-based approaches on a variety of user-defined concepts, and demonstrate that our method enables more high-fidelity inversion and robust editability with a significantly faster encoding process. Our code will be publicly available at https://github.com/csyxwei/ELITE.

View paper on

Share this with someone who'll enjoy it:

Title:ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation

Paper and Code