Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Riccardo Gherardi

Effective Real Image Editing with Accelerated Iterative Diffusion Inversion

Sep 10, 2023

Zhihong Pan, Riccardo Gherardi, Xiufeng Xie, Stephen Huang

Abstract:Despite all recent progress, it is still challenging to edit and manipulate natural images with modern generative models. When using Generative Adversarial Network (GAN), one major hurdle is in the inversion process mapping a real image to its corresponding noise vector in the latent space, since its necessary to be able to reconstruct an image to edit its contents. Likewise for Denoising Diffusion Implicit Models (DDIM), the linearization assumption in each inversion step makes the whole deterministic inversion process unreliable. Existing approaches that have tackled the problem of inversion stability often incur in significant trade-offs in computational efficiency. In this work we propose an Accelerated Iterative Diffusion Inversion method, dubbed AIDI, that significantly improves reconstruction accuracy with minimal additional overhead in space and time complexity. By using a novel blended guidance technique, we show that effective results can be obtained on a large range of image editing tasks without large classifier-free guidance in inversion. Furthermore, when compared with other diffusion inversion based works, our proposed process is shown to be more robust for fast image editing in the 10 and 20 diffusion steps' regimes.

* Accepted to ICCV 2023 (Oral)

Via

Access Paper or Ask Questions

HollowNeRF: Pruning Hashgrid-Based NeRFs with Trainable Collision Mitigation

Aug 19, 2023

Xiufeng Xie, Riccardo Gherardi, Zhihong Pan, Stephen Huang

Abstract:Neural radiance fields (NeRF) have garnered significant attention, with recent works such as Instant-NGP accelerating NeRF training and evaluation through a combination of hashgrid-based positional encoding and neural networks. However, effectively leveraging the spatial sparsity of 3D scenes remains a challenge. To cull away unnecessary regions of the feature grid, existing solutions rely on prior knowledge of object shape or periodically estimate object shape during training by repeated model evaluations, which are costly and wasteful. To address this issue, we propose HollowNeRF, a novel compression solution for hashgrid-based NeRF which automatically sparsifies the feature grid during the training phase. Instead of directly compressing dense features, HollowNeRF trains a coarse 3D saliency mask that guides efficient feature pruning, and employs an alternating direction method of multipliers (ADMM) pruner to sparsify the 3D saliency mask during training. By exploiting the sparsity in the 3D scene to redistribute hash collisions, HollowNeRF improves rendering quality while using a fraction of the parameters of comparable state-of-the-art solutions, leading to a better cost-accuracy trade-off. Our method delivers comparable rendering quality to Instant-NGP, while utilizing just 31% of the parameters. In addition, our solution can achieve a PSNR accuracy gain of up to 1dB using only 56% of the parameters.

* Accepted to ICCV 2023

Via

Access Paper or Ask Questions

Hierarchical structure-and-motion recovery from uncalibrated images

Jun 01, 2015

Roberto Toldo, Riccardo Gherardi, Michela Farenzena, Andrea Fusiello

Figure 1 for Hierarchical structure-and-motion recovery from uncalibrated images

Figure 2 for Hierarchical structure-and-motion recovery from uncalibrated images

Figure 3 for Hierarchical structure-and-motion recovery from uncalibrated images

Figure 4 for Hierarchical structure-and-motion recovery from uncalibrated images

Abstract:This paper addresses the structure-and-motion problem, that requires to find camera motion and 3D struc- ture from point matches. A new pipeline, dubbed Samantha, is presented, that departs from the prevailing sequential paradigm and embraces instead a hierarchical approach. This method has several advantages, like a provably lower computational complexity, which is necessary to achieve true scalability, and better error containment, leading to more stability and less drift. Moreover, a practical autocalibration procedure allows to process images without ancillary information. Experiments with real data assess the accuracy and the computational efficiency of the method.

* Computer Vision and Image Understanding, Volume 140, November 2015, Pages 127-143
* Accepted for publication in CVIU

Via

Access Paper or Ask Questions