Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Uncovering the Text Embedding in Text-to-Image Diffusion Models

Apr 01, 2024

Hu Yu, Hao Luo, Fan Wang, Feng Zhao

Figure 1 for Uncovering the Text Embedding in Text-to-Image Diffusion Models

Figure 2 for Uncovering the Text Embedding in Text-to-Image Diffusion Models

Figure 3 for Uncovering the Text Embedding in Text-to-Image Diffusion Models

Figure 4 for Uncovering the Text Embedding in Text-to-Image Diffusion Models

Share this with someone who'll enjoy it:

Abstract:The correspondence between input text and the generated image exhibits opacity, wherein minor textual modifications can induce substantial deviations in the generated image. While, text embedding, as the pivotal intermediary between text and images, remains relatively underexplored. In this paper, we address this research gap by delving into the text embedding space, unleashing its capacity for controllable image editing and explicable semantic direction attributes within a learning-free framework. Specifically, we identify two critical insights regarding the importance of per-word embedding and their contextual correlations within text embedding, providing instructive principles for learning-free image editing. Additionally, we find that text embedding inherently possesses diverse semantic potentials, and further reveal this property through the lens of singular value decomposition (SVD). These uncovered properties offer practical utility for image editing and semantic discovery. More importantly, we expect the in-depth analyses and findings of the text embedding can enhance the understanding of text-to-image diffusion models.

View paper on

Share this with someone who'll enjoy it:

Title:Uncovering the Text Embedding in Text-to-Image Diffusion Models

Paper and Code