Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Amir Barda

Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects

Nov 30, 2024

Amir Barda, Matheus Gadelha, Vladimir G. Kim, Noam Aigerman, Amit H. Bermano, Thibault Groueix

Abstract:We propose a generative technique to edit 3D shapes, represented as meshes, NeRFs, or Gaussian Splats, in approximately 3 seconds, without the need for running an SDS type of optimization. Our key insight is to cast 3D editing as a multiview image inpainting problem, as this representation is generic and can be mapped back to any 3D representation using the bank of available Large Reconstruction Models. We explore different fine-tuning strategies to obtain both multiview generation and inpainting capabilities within the same diffusion model. In particular, the design of the inpainting mask is an important factor of training an inpainting model, and we propose several masking strategies to mimic the types of edits a user would perform on a 3D shape. Our approach takes 3D generative editing from hours to seconds and produces higher-quality results compared to previous works.

* project page: https://amirbarda.github.io/Instant3dit.github.io/

Via

Access Paper or Ask Questions

MeshCNN Fundamentals: Geometric Learning through a Reconstructable Representation

May 27, 2021

Amir Barda, Yotam Erel, Amit H. Bermano

Figure 1 for MeshCNN Fundamentals: Geometric Learning through a Reconstructable Representation

Figure 2 for MeshCNN Fundamentals: Geometric Learning through a Reconstructable Representation

Figure 3 for MeshCNN Fundamentals: Geometric Learning through a Reconstructable Representation

Figure 4 for MeshCNN Fundamentals: Geometric Learning through a Reconstructable Representation

Abstract:Mesh-based learning is one of the popular approaches nowadays to learn shapes. The most established backbone in this field is MeshCNN. In this paper, we propose infusing MeshCNN with geometric reasoning to achieve higher quality learning. Through careful analysis of the way geometry is represented through-out the network, we submit that this representation should be rigid motion invariant, and should allow reconstructing the original geometry. Accordingly, we introduce the first and second fundamental forms as an edge-centric, rotation and translation invariant, reconstructable representation. In addition, we update the originally proposed pooling scheme to be more geometrically driven. We validate our analysis through experimentation, and present consistent improvement upon the MeshCNN baseline, as well as other more elaborate state-of-the-art architectures. Furthermore, we demonstrate this fundamental forms-based representation opens the door to accessible generative machine learning over meshes.

Via

Access Paper or Ask Questions