Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lohit Petikam

Eyelid Fold Consistency in Facial Modeling

Oct 17, 2024

Lohit Petikam, Charlie Hewitt, Fatemeh Saleh, Tadas Baltrušaitis

Figure 1 for Eyelid Fold Consistency in Facial Modeling

Figure 2 for Eyelid Fold Consistency in Facial Modeling

Figure 3 for Eyelid Fold Consistency in Facial Modeling

Figure 4 for Eyelid Fold Consistency in Facial Modeling

Abstract:Eyelid shape is integral to identity and likeness in human facial modeling. Human eyelids are diverse in appearance with varied skin fold and epicanthal fold morphology between individuals. Existing parametric face models express eyelid shape variation to an extent, but do not preserve sufficient likeness across a diverse range of individuals. We propose a new definition of eyelid fold consistency and implement geometric processing techniques to model diverse eyelid shapes in a unified topology. Using this method we reprocess data used to train a parametric face model and demonstrate significant improvements in face-related machine learning tasks.

Via

Access Paper or Ask Questions

Look Ma, no markers: holistic performance capture without the hassle

Oct 15, 2024

Charlie Hewitt, Fatemeh Saleh, Sadegh Aliakbarian, Lohit Petikam, Shideh Rezaeifar, Louis Florentin, Zafiirah Hosenie, Thomas J Cashman, Julien Valentin, Darren Cosker(+1 more)

Figure 1 for Look Ma, no markers: holistic performance capture without the hassle

Figure 2 for Look Ma, no markers: holistic performance capture without the hassle

Figure 3 for Look Ma, no markers: holistic performance capture without the hassle

Figure 4 for Look Ma, no markers: holistic performance capture without the hassle

Abstract:We tackle the problem of highly-accurate, holistic performance capture for the face, body and hands simultaneously. Motion-capture technologies used in film and game production typically focus only on face, body or hand capture independently, involve complex and expensive hardware and a high degree of manual intervention from skilled operators. While machine-learning-based approaches exist to overcome these problems, they usually only support a single camera, often operate on a single part of the body, do not produce precise world-space results, and rarely generalize outside specific contexts. In this work, we introduce the first technique for marker-free, high-quality reconstruction of the complete human body, including eyes and tongue, without requiring any calibration, manual intervention or custom hardware. Our approach produces stable world-space results from arbitrary camera rigs as well as supporting varied capture environments and clothing. We achieve this through a hybrid approach that leverages machine learning models trained exclusively on synthetic data and powerful parametric models of human shape and motion. We evaluate our method on a number of body, face and hand reconstruction benchmarks and demonstrate state-of-the-art results that generalize on diverse datasets.

Via

Access Paper or Ask Questions

GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions

Jun 07, 2024

Salvatore Esposito, Qingshan Xu, Kacper Kania, Charlie Hewitt, Octave Mariotti, Lohit Petikam, Julien Valentin, Arno Onken, Oisin Mac Aodha

Figure 1 for GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions

Figure 2 for GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions

Figure 3 for GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions

Figure 4 for GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions

Abstract:We introduce a new generative approach for synthesizing 3D geometry and images from single-view collections. Most existing approaches predict volumetric density to render multi-view consistent images. By employing volumetric rendering using neural radiance fields, they inherit a key limitation: the generated geometry is noisy and unconstrained, limiting the quality and utility of the output meshes. To address this issue, we propose GeoGen, a new SDF-based 3D generative model trained in an end-to-end manner. Initially, we reinterpret the volumetric density as a Signed Distance Function (SDF). This allows us to introduce useful priors to generate valid meshes. However, those priors prevent the generative model from learning details, limiting the applicability of the method to real-world scenarios. To alleviate that problem, we make the transformation learnable and constrain the rendered depth map to be consistent with the zero-level set of the SDF. Through the lens of adversarial training, we encourage the network to produce higher fidelity details on the output meshes. For evaluation, we introduce a synthetic dataset of human avatars captured from 360-degree camera angles, to overcome the challenges presented by real-world datasets, which often lack 3D consistency and do not cover all camera angles. Our experiments on multiple datasets show that GeoGen produces visually and quantitatively better geometry than the previous generative models based on neural radiance fields.

* Computer Vision and Pattern Recognition 2024

Via

Access Paper or Ask Questions

Procedural Humans for Computer Vision

Jan 03, 2023

Charlie Hewitt, Tadas Baltrušaitis, Erroll Wood, Lohit Petikam, Louis Florentin, Hanz Cuevas Velasquez

Figure 1 for Procedural Humans for Computer Vision

Figure 2 for Procedural Humans for Computer Vision

Figure 3 for Procedural Humans for Computer Vision

Figure 4 for Procedural Humans for Computer Vision

Abstract:Recent work has shown the benefits of synthetic data for use in computer vision, with applications ranging from autonomous driving to face landmark detection and reconstruction. There are a number of benefits of using synthetic data from privacy preservation and bias elimination to quality and feasibility of annotation. Generating human-centered synthetic data is a particular challenge in terms of realism and domain-gap, though recent work has shown that effective machine learning models can be trained using synthetic face data alone. We show that this can be extended to include the full body by building on the pipeline of Wood et al. to generate synthetic images of humans in their entirety, with ground-truth annotations for computer vision applications. In this report we describe how we construct a parametric model of the face and body, including articulated hands; our rendering pipeline to generate realistic images of humans based on this body model; an approach for training DNNs to regress a dense set of landmarks covering the entire body; and a method for fitting our body model to dense landmarks predicted from multiple views.

Via

Access Paper or Ask Questions