Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Eyal Gomel

Diffusion-Based Attention Warping for Consistent 3D Scene Editing

Dec 10, 2024

Eyal Gomel, Lior Wolf

Figure 1 for Diffusion-Based Attention Warping for Consistent 3D Scene Editing

Figure 2 for Diffusion-Based Attention Warping for Consistent 3D Scene Editing

Figure 3 for Diffusion-Based Attention Warping for Consistent 3D Scene Editing

Figure 4 for Diffusion-Based Attention Warping for Consistent 3D Scene Editing

Abstract:We present a novel method for 3D scene editing using diffusion models, designed to ensure view consistency and realism across perspectives. Our approach leverages attention features extracted from a single reference image to define the intended edits. These features are warped across multiple views by aligning them with scene geometry derived from Gaussian splatting depth estimates. Injecting these warped features into other viewpoints enables coherent propagation of edits, achieving high fidelity and spatial alignment in 3D space. Extensive evaluations demonstrate the effectiveness of our method in generating versatile edits of 3D scenes, significantly advancing the capabilities of scene manipulation compared to the existing methods. Project page: \url{https://attention-warp.github.io}

Via

Access Paper or Ask Questions

Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks

Sep 07, 2023

Eyal Gomel, Tal Shaharabany, Lior Wolf

Figure 1 for Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks

Figure 2 for Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks

Figure 3 for Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks

Figure 4 for Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks

Abstract:It has been established that training a box-based detector network can enhance the localization performance of weakly supervised and unsupervised methods. Moreover, we extend this understanding by demonstrating that these detectors can be utilized to improve the original network, paving the way for further advancements. To accomplish this, we train the detectors on top of the network output instead of the image data and apply suitable loss backpropagation. Our findings reveal a significant improvement in phrase grounding for the ``what is where by looking'' task, as well as various methods of unsupervised object discovery. Our code is available at https://github.com/eyalgomel/box-based-refinement.

Via

Access Paper or Ask Questions