Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models

Jul 26, 2022

Robin Rombach, Andreas Blattmann, Björn Ommer

Figure 1 for Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models

Figure 2 for Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models

Figure 3 for Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models

Figure 4 for Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models

Share this with someone who'll enjoy it:

Abstract:Novel architectures have recently improved generative image synthesis leading to excellent visual quality in various tasks. Of particular note is the field of ``AI-Art'', which has seen unprecedented growth with the emergence of powerful multimodal models such as CLIP. By combining speech and image synthesis models, so-called ``prompt-engineering'' has become established, in which carefully selected and composed sentences are used to achieve a certain visual style in the synthesized image. In this note, we present an alternative approach based on retrieval-augmented diffusion models (RDMs). In RDMs, a set of nearest neighbors is retrieved from an external database during training for each training instance, and the diffusion model is conditioned on these informative samples. During inference (sampling), we replace the retrieval database with a more specialized database that contains, for example, only images of a particular visual style. This provides a novel way to prompt a general trained model after training and thereby specify a particular visual style. As shown by our experiments, this approach is superior to specifying the visual style within the text prompt. We open-source code and model weights at https://github.com/CompVis/latent-diffusion .

* 4 pages

View paper on

Share this with someone who'll enjoy it:

Title:Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models

Paper and Code