Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer

Oct 19, 2024

Ying Hu, Chenyi Zhuang, Pan Gao

Figure 1 for DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer

Figure 2 for DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer

Figure 3 for DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer

Figure 4 for DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer

Share this with someone who'll enjoy it:

Abstract:Style transfer aims to fuse the artistic representation of a style image with the structural information of a content image. Existing methods train specific networks or utilize pre-trained models to learn content and style features. However, they rely solely on textual or spatial representations that are inadequate to achieve the balance between content and style. In this work, we propose a novel and training-free approach for style transfer, combining textual embedding with spatial features and separating the injection of content or style. Specifically, we adopt the BLIP-2 encoder to extract the textual representation of the style image. We utilize the DDIM inversion technique to extract intermediate embeddings in content and style branches as spatial features. Finally, we harness the step-by-step property of diffusion models by separating the injection of content and style in the target branch, which improves the balance between content preservation and style fusion. Various experiments have demonstrated the effectiveness and robustness of our proposed DiffeseST for achieving balanced and controllable style transfer results, as well as the potential to extend to other tasks.

* Accepted to ACMMM Asia 2024. Code is available at https://github.com/I2-Multimedia-Lab/DiffuseST

View paper on

Share this with someone who'll enjoy it:

Title:DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer

Paper and Code