Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

Mar 03, 2025

Zhendong Wang, Jianmin Bao, Shuyang Gu, Dong Chen, Wengang Zhou, Houqiang Li

Figure 1 for DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

Figure 2 for DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

Figure 3 for DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

Figure 4 for DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

Share this with someone who'll enjoy it:

Abstract:In this paper, we present DesignDiffusion, a simple yet effective framework for the novel task of synthesizing design images from textual descriptions. A primary challenge lies in generating accurate and style-consistent textual and visual content. Existing works in a related task of visual text generation often focus on generating text within given specific regions, which limits the creativity of generation models, resulting in style or color inconsistencies between textual and visual elements if applied to design image generation. To address this issue, we propose an end-to-end, one-stage diffusion-based framework that avoids intricate components like position and layout modeling. Specifically, the proposed framework directly synthesizes textual and visual design elements from user prompts. It utilizes a distinctive character embedding derived from the visual text to enhance the input prompt, along with a character localization loss for enhanced supervision during text generation. Furthermore, we employ a self-play Direct Preference Optimization fine-tuning strategy to improve the quality and accuracy of the synthesized visual text. Extensive experiments demonstrate that DesignDiffusion achieves state-of-the-art performance in design image generation.

* Accepted by CVPR 2025

View paper on

Share this with someone who'll enjoy it:

Title:DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

Paper and Code