Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yunchen Yuan

Are Conditional Latent Diffusion Models Effective for Image Restoration?

Dec 13, 2024

Yunchen Yuan, Junyuan Xiao, Xinjie Li

Figure 1 for Are Conditional Latent Diffusion Models Effective for Image Restoration?

Figure 2 for Are Conditional Latent Diffusion Models Effective for Image Restoration?

Figure 3 for Are Conditional Latent Diffusion Models Effective for Image Restoration?

Figure 4 for Are Conditional Latent Diffusion Models Effective for Image Restoration?

Abstract:Recent advancements in image restoration increasingly employ conditional latent diffusion models (CLDMs). While these models have demonstrated notable performance improvements in recent years, this work questions their suitability for IR tasks. CLDMs excel in capturing high-level semantic correlations, making them effective for tasks like text-to-image generation with spatial conditioning. However, in IR, where the goal is to enhance image perceptual quality, these models face difficulty of modeling the relationship between degraded images and ground truth images using a low-level representation. To support our claims, we compare state-of-the-art CLDMs with traditional image restoration models through extensive experiments. Results reveal that despite the scaling advantages of CLDMs, they suffer from high distortion and semantic deviation, especially in cases with minimal degradation, where traditional methods outperform them. Additionally, we perform empirical studies to examine the impact of various CLDM design elements on their restoration performance. We hope this finding inspires a reexamination of current CLDM-based IR solutions, opening up more opportunities in this field.

Via

Access Paper or Ask Questions

DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior

Apr 04, 2024

Yiming Zhang, Zhe Wang, Xinjie Li, Yunchen Yuan, Chengsong Zhang, Xiao Sun, Zhihang Zhong, Jian Wang

Figure 1 for DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior

Figure 2 for DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior

Figure 3 for DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior

Figure 4 for DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior

Abstract:Human body restoration plays a vital role in various applications related to the human body. Despite recent advances in general image restoration using generative models, their performance in human body restoration remains mediocre, often resulting in foreground and background blending, over-smoothing surface textures, missing accessories, and distorted limbs. Addressing these challenges, we propose a novel approach by constructing a human body-aware diffusion model that leverages domain-specific knowledge to enhance performance. Specifically, we employ a pretrained body attention module to guide the diffusion model's focus on the foreground, addressing issues caused by blending between the subject and background. We also demonstrate the value of revisiting the language modality of the diffusion model in restoration tasks by seamlessly incorporating text prompt to improve the quality of surface texture and additional clothing and accessories details. Additionally, we introduce a diffusion sampler tailored for fine-grained human body parts, utilizing local semantic information to rectify limb distortions. Lastly, we collect a comprehensive dataset for benchmarking and advancing the field of human body restoration. Extensive experimental validation showcases the superiority of our approach, both quantitatively and qualitatively, over existing methods.

Via

Access Paper or Ask Questions