Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models

Jun 10, 2022

Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M Patel

Figure 1 for Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models

Figure 2 for Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models

Figure 3 for Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models

Figure 4 for Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models

Share this with someone who'll enjoy it:

Abstract:Image synthesis under multi-modal priors is a useful and challenging task that has received increasing attention in recent years. A major challenge in using generative models to accomplish this task is the lack of paired data containing all modalities (i.e. priors) and corresponding outputs. In recent work, a variational auto-encoder (VAE) model was trained in a weakly supervised manner to address this challenge. Since the generative power of VAEs is usually limited, it is difficult for this method to synthesize images belonging to complex distributions. To this end, we propose a solution based on a denoising diffusion probabilistic models to synthesise images under multi-model priors. Based on the fact that the distribution over each time step in the diffusion model is Gaussian, in this work we show that there exists a closed-form expression to the generate the image corresponds to the given modalities. The proposed solution does not require explicit retraining for all modalities and can leverage the outputs of individual modalities to generate realistic images according to different constraints. We conduct studies on two real-world datasets to demonstrate the effectiveness of our approach

View paper on

Share this with someone who'll enjoy it:

Title:Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models

Paper and Code