Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LoMOE: Localized Multi-Object Editing via Multi-Diffusion

Mar 01, 2024

Goirik Chakrabarty, Aditya Chandrasekar, Ramya Hebbalaguppe, Prathosh AP

Figure 1 for LoMOE: Localized Multi-Object Editing via Multi-Diffusion

Figure 2 for LoMOE: Localized Multi-Object Editing via Multi-Diffusion

Figure 3 for LoMOE: Localized Multi-Object Editing via Multi-Diffusion

Figure 4 for LoMOE: Localized Multi-Object Editing via Multi-Diffusion

Share this with someone who'll enjoy it:

Abstract:Recent developments in the field of diffusion models have demonstrated an exceptional capacity to generate high-quality prompt-conditioned image edits. Nevertheless, previous approaches have primarily relied on textual prompts for image editing, which tend to be less effective when making precise edits to specific objects or fine-grained regions within a scene containing single/multiple objects. We introduce a novel framework for zero-shot localized multi-object editing through a multi-diffusion process to overcome this challenge. This framework empowers users to perform various operations on objects within an image, such as adding, replacing, or editing $\textbf{many}$ objects in a complex scene $\textbf{in one pass}$. Our approach leverages foreground masks and corresponding simple text prompts that exert localized influences on the target regions resulting in high-fidelity image editing. A combination of cross-attention and background preservation losses within the latent space ensures that the characteristics of the object being edited are preserved while simultaneously achieving a high-quality, seamless reconstruction of the background with fewer artifacts compared to the current methods. We also curate and release a dataset dedicated to multi-object editing, named $\texttt{LoMOE}$-Bench. Our experiments against existing state-of-the-art methods demonstrate the improved effectiveness of our approach in terms of both image editing quality and inference speed.

* 18 pages

View paper on

Share this with someone who'll enjoy it:

Title:LoMOE: Localized Multi-Object Editing via Multi-Diffusion

Paper and Code