Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Iterative Prompt Relabeling for diffusion model with RLDF

Dec 23, 2023

Jiaxin Ge, Xinyan Chen, Tianjun Zhang, Shanghang Zhang

Figure 1 for Iterative Prompt Relabeling for diffusion model with RLDF

Figure 2 for Iterative Prompt Relabeling for diffusion model with RLDF

Figure 3 for Iterative Prompt Relabeling for diffusion model with RLDF

Figure 4 for Iterative Prompt Relabeling for diffusion model with RLDF

Share this with someone who'll enjoy it:

Abstract:Diffusion models have shown impressive performance in many domains, including image generation, time series prediction, and reinforcement learning. The algorithm demonstrates superior performance over the traditional GAN and transformer based methods. However, the model's capability to follow natural language instructions (e.g., spatial relationships between objects, generating complex scenes) is still unsatisfactory. This has been an important research area to enhance such capability. Prior works adopt reinforcement learning to adjust the behavior of the diffusion models. However, RL methods not only require careful reward design and complex hyperparameter tuning, but also fails to incorporate rich natural language feedback. In this work, we propose iterative prompt relabeling (IP-RLDF), a novel algorithm that aligns images to text through iterative image sampling and prompt relabeling. IP-RLDF first samples a batch of images conditioned on the text, then relabels the text prompts of unmatched text-image pairs with classifier feedback. We conduct thorough experiments on three different models, including SDv2, GLIGEN, and SDXL, testing their capability to generate images following instructions. With IP-RLDF, we improved up to 15.22% (absolute improvement) on the challenging spatial relation VISOR benchmark, demonstrating superior performance compared to previous RL methods.

View paper on

Share this with someone who'll enjoy it:

Title:Iterative Prompt Relabeling for diffusion model with RLDF

Paper and Code