Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models

Apr 05, 2024

Sangwon Jang, Jaehyeong Jo, Kimin Lee, Sung Ju Hwang

Share this with someone who'll enjoy it:

Abstract:Text-to-image diffusion models have shown remarkable success in generating a personalized subject based on a few reference images. However, current methods struggle with handling multiple subjects simultaneously, often resulting in mixed identities with combined attributes from different subjects. In this work, we present MuDI, a novel framework that enables multi-subject personalization by effectively decoupling identities from multiple subjects. Our main idea is to utilize segmented subjects generated by the Segment Anything Model for both training and inference, as a form of data augmentation for training and initialization for the generation process. Our experiments demonstrate that MuDI can produce high-quality personalized images without identity mixing, even for highly similar subjects as shown in Figure 1. In human evaluation, MuDI shows twice as many successes for personalizing multiple subjects without identity mixing over existing baselines and is preferred over 70% compared to the strongest baseline. More results are available at https://mudi-t2i.github.io/.

* Preprint. Project page: https://mudi-t2i.github.io/

View paper on

Share this with someone who'll enjoy it:

Title:Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models

Paper and Code