Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Violeta Menéndez González

ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs

Nov 30, 2023

Violeta Menéndez González, Andrew Gilbert, Graeme Phillipson, Stephen Jolly, Simon Hadfield

Figure 1 for ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs

Figure 2 for ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs

Figure 3 for ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs

Figure 4 for ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs

Abstract:In the field of media production, video editing techniques play a pivotal role. Recent approaches have had great success at performing novel view image synthesis of static scenes. But adding temporal information adds an extra layer of complexity. Previous models have focused on implicitly representing static and dynamic scenes using NeRF. These models achieve impressive results but are costly at training and inference time. They overfit an MLP to describe the scene implicitly as a function of position. This paper proposes ZeST-NeRF, a new approach that can produce temporal NeRFs for new scenes without retraining. We can accurately reconstruct novel views using multi-view synthesis techniques and scene flow-field estimation, trained only with unrelated scenes. We demonstrate how existing state-of-the-art approaches from a range of fields cannot adequately solve this new task and demonstrate the efficacy of our solution. The resulting network improves quantitatively by 15% and produces significantly better visual results.

* VUA BMVC 2023

Via

Access Paper or Ask Questions

SVS: Adversarial refinement for sparse novel view synthesis

Nov 14, 2022

Violeta Menéndez González, Andrew Gilbert, Graeme Phillipson, Stephen Jolly, Simon Hadfield

Abstract:This paper proposes Sparse View Synthesis. This is a view synthesis problem where the number of reference views is limited, and the baseline between target and reference view is significant. Under these conditions, current radiance field methods fail catastrophically due to inescapable artifacts such 3D floating blobs, blurring and structural duplication, whenever the number of reference views is limited, or the target view diverges significantly from the reference views. Advances in network architecture and loss regularisation are unable to satisfactorily remove these artifacts. The occlusions within the scene ensure that the true contents of these regions is simply not available to the model. In this work, we instead focus on hallucinating plausible scene contents within such regions. To this end we unify radiance field models with adversarial learning and perceptual losses. The resulting system provides up to 60% improvement in perceptual accuracy compared to current state-of-the-art radiance field models on this problem.

* BMVC 2022

Via

Access Paper or Ask Questions

SaiNet: Stereo aware inpainting behind objects with generative networks

May 14, 2022

Violeta Menéndez González, Andrew Gilbert, Graeme Phillipson, Stephen Jolly, Simon Hadfield

Figure 1 for SaiNet: Stereo aware inpainting behind objects with generative networks

Figure 2 for SaiNet: Stereo aware inpainting behind objects with generative networks

Figure 3 for SaiNet: Stereo aware inpainting behind objects with generative networks

Figure 4 for SaiNet: Stereo aware inpainting behind objects with generative networks

Abstract:In this work, we present an end-to-end network for stereo-consistent image inpainting with the objective of inpainting large missing regions behind objects. The proposed model consists of an edge-guided UNet-like network using Partial Convolutions. We enforce multi-view stereo consistency by introducing a disparity loss. More importantly, we develop a training scheme where the model is learned from realistic stereo masks representing object occlusions, instead of the more common random masks. The technique is trained in a supervised way. Our evaluation shows competitive results compared to previous state-of-the-art techniques.

* Presented at AI4CC workshop at CVPR

Via

Access Paper or Ask Questions