Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Igor Santesteban

FRESA:Feedforward Reconstruction of Personalized Skinned Avatars from Few Images

Mar 24, 2025

Rong Wang, Fabian Prada, Ziyan Wang, Zhongshi Jiang, Chengxiang Yin, Junxuan Li, Shunsuke Saito, Igor Santesteban, Javier Romero, Rohan Joshi(+3 more)

Abstract:We present a novel method for reconstructing personalized 3D human avatars with realistic animation from only a few images. Due to the large variations in body shapes, poses, and cloth types, existing methods mostly require hours of per-subject optimization during inference, which limits their practical applications. In contrast, we learn a universal prior from over a thousand clothed humans to achieve instant feedforward generation and zero-shot generalization. Specifically, instead of rigging the avatar with shared skinning weights, we jointly infer personalized avatar shape, skinning weights, and pose-dependent deformations, which effectively improves overall geometric fidelity and reduces deformation artifacts. Moreover, to normalize pose variations and resolve coupled ambiguity between canonical shapes and skinning weights, we design a 3D canonicalization process to produce pixel-aligned initial conditions, which helps to reconstruct fine-grained geometric details. We then propose a multi-frame feature aggregation to robustly reduce artifacts introduced in canonicalization and fuse a plausible avatar preserving person-specific identities. Finally, we train the model in an end-to-end framework on a large-scale capture dataset, which contains diverse human subjects paired with high-quality 3D scans. Extensive experiments show that our method generates more authentic reconstruction and animation than state-of-the-arts, and can be directly generalized to inputs from casually taken phone photos. Project page and code is available at https://github.com/rongakowang/FRESA.

* Published in CVPR 2025

Via

Access Paper or Ask Questions

Relightable Full-Body Gaussian Codec Avatars

Jan 24, 2025

Shaofei Wang, Tomas Simon, Igor Santesteban, Timur Bagautdinov, Junxuan Li, Vasu Agrawal, Fabian Prada, Shoou-I Yu, Pace Nalbone, Matt Gramlich(+8 more)

Figure 1 for Relightable Full-Body Gaussian Codec Avatars

Figure 2 for Relightable Full-Body Gaussian Codec Avatars

Figure 3 for Relightable Full-Body Gaussian Codec Avatars

Figure 4 for Relightable Full-Body Gaussian Codec Avatars

Abstract:We propose Relightable Full-Body Gaussian Codec Avatars, a new approach for modeling relightable full-body avatars with fine-grained details including face and hands. The unique challenge for relighting full-body avatars lies in the large deformations caused by body articulation and the resulting impact on appearance caused by light transport. Changes in body pose can dramatically change the orientation of body surfaces with respect to lights, resulting in both local appearance changes due to changes in local light transport functions, as well as non-local changes due to occlusion between body parts. To address this, we decompose the light transport into local and non-local effects. Local appearance changes are modeled using learnable zonal harmonics for diffuse radiance transfer. Unlike spherical harmonics, zonal harmonics are highly efficient to rotate under articulation. This allows us to learn diffuse radiance transfer in a local coordinate frame, which disentangles the local radiance transfer from the articulation of the body. To account for non-local appearance changes, we introduce a shadow network that predicts shadows given precomputed incoming irradiance on a base mesh. This facilitates the learning of non-local shadowing between the body parts. Finally, we use a deferred shading approach to model specular radiance transfer and better capture reflections and highlights such as eye glints. We demonstrate that our approach successfully models both the local and non-local light transport required for relightable full-body avatars, with a superior generalization ability under novel illumination conditions and unseen poses.

* 14 pages, 9 figures. Project page: https://neuralbodies.github.io/RFGCA

Via

Access Paper or Ask Questions

SqueezeMe: Efficient Gaussian Avatars for VR

Dec 19, 2024

Shunsuke Saito, Stanislav Pidhorskyi, Igor Santesteban, Forrest Iandola, Divam Gupta, Anuj Pahuja, Nemanja Bartolovic, Frank Yu, Emanuel Garbin, Tomas Simon

Figure 1 for SqueezeMe: Efficient Gaussian Avatars for VR

Figure 2 for SqueezeMe: Efficient Gaussian Avatars for VR

Figure 3 for SqueezeMe: Efficient Gaussian Avatars for VR

Figure 4 for SqueezeMe: Efficient Gaussian Avatars for VR

Abstract:Gaussian Splatting has enabled real-time 3D human avatars with unprecedented levels of visual quality. While previous methods require a desktop GPU for real-time inference of a single avatar, we aim to squeeze multiple Gaussian avatars onto a portable virtual reality headset with real-time drivable inference. We begin by training a previous work, Animatable Gaussians, on a high quality dataset captured with 512 cameras. The Gaussians are animated by controlling base set of Gaussians with linear blend skinning (LBS) motion and then further adjusting the Gaussians with a neural network decoder to correct their appearance. When deploying the model on a Meta Quest 3 VR headset, we find two major computational bottlenecks: the decoder and the rendering. To accelerate the decoder, we train the Gaussians in UV-space instead of pixel-space, and we distill the decoder to a single neural network layer. Further, we discover that neighborhoods of Gaussians can share a single corrective from the decoder, which provides an additional speedup. To accelerate the rendering, we develop a custom pipeline in Vulkan that runs on the mobile GPU. Putting it all together, we run 3 Gaussian avatars concurrently at 72 FPS on a VR headset. Demo videos are at https://forresti.github.io/squeezeme.

* Initial version

Via

Access Paper or Ask Questions

SNUG: Self-Supervised Neural Dynamic Garments

Apr 05, 2022

Igor Santesteban, Miguel A. Otaduy, Dan Casas

Figure 1 for SNUG: Self-Supervised Neural Dynamic Garments

Figure 2 for SNUG: Self-Supervised Neural Dynamic Garments

Figure 3 for SNUG: Self-Supervised Neural Dynamic Garments

Figure 4 for SNUG: Self-Supervised Neural Dynamic Garments

Abstract:We present a self-supervised method to learn dynamic 3D deformations of garments worn by parametric human bodies. State-of-the-art data-driven approaches to model 3D garment deformations are trained using supervised strategies that require large datasets, usually obtained by expensive physics-based simulation methods or professional multi-camera capture setups. In contrast, we propose a new training scheme that removes the need for ground-truth samples, enabling self-supervised training of dynamic 3D garment deformations. Our key contribution is to realize that physics-based deformation models, traditionally solved in a frame-by-frame basis by implicit integrators, can be recasted as an optimization problem. We leverage such optimization-based scheme to formulate a set of physics-based loss terms that can be used to train neural networks without precomputing ground-truth data. This allows us to learn models for interactive garments, including dynamic deformations and fine wrinkles, with two orders of magnitude speed up in training time compared to state-of-the-art supervised methods

* CVPR 2022 (Oral). Project website: http://mslab.es/projects/SNUG/

Via

Access Paper or Ask Questions

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

May 13, 2021

Igor Santesteban, Nils Thuerey, Miguel A. Otaduy, Dan Casas

Figure 1 for Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

Figure 2 for Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

Figure 3 for Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

Figure 4 for Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

Abstract:We propose a new generative model for 3D garment deformations that enables us to learn, for the first time, a data-driven method for virtual try-on that effectively addresses garment-body collisions. In contrast to existing methods that require an undesirable postprocessing step to fix garment-body interpenetrations at test time, our approach directly outputs 3D garment configurations that do not collide with the underlying body. Key to our success is a new canonical space for garments that removes pose-and-shape deformations already captured by a new diffused human body model, which extrapolates body surface properties such as skinning weights and blendshapes to any 3D point. We leverage this representation to train a generative model with a novel self-supervised collision term that learns to reliably solve garment-body interpenetrations. We extensively evaluate and compare our results with recently proposed data-driven methods, and show that our method is the first to successfully address garment-body contact in unseen body shapes and motions, without compromising realism and detail.

* Accepted to CVPR 2021. Project website http://mslab.es/projects/SelfSupervisedGarmentCollisions

Via

Access Paper or Ask Questions

Fully Convolutional Graph Neural Networks for Parametric Virtual Try-On

Sep 09, 2020

Raquel Vidaurre, Igor Santesteban, Elena Garces, Dan Casas

Figure 1 for Fully Convolutional Graph Neural Networks for Parametric Virtual Try-On

Figure 2 for Fully Convolutional Graph Neural Networks for Parametric Virtual Try-On

Figure 3 for Fully Convolutional Graph Neural Networks for Parametric Virtual Try-On

Figure 4 for Fully Convolutional Graph Neural Networks for Parametric Virtual Try-On

Abstract:We present a learning-based approach for virtual try-on applications based on a fully convolutional graph neural network. In contrast to existing data-driven models, which are trained for a specific garment or mesh topology, our fully convolutional model can cope with a large family of garments, represented as parametric predefined 2D panels with arbitrary mesh topology, including long dresses, shirts, and tight tops. Under the hood, our novel geometric deep learning approach learns to drape 3D garments by decoupling the three different sources of deformations that condition the fit of clothing: garment type, target body shape, and material. Specifically, we first learn a regressor that predicts the 3D drape of the input parametric garment when worn by a mean body shape. Then, after a mesh topology optimization step where we generate a sufficient level of detail for the input garment type, we further deform the mesh to reproduce deformations caused by the target body shape. Finally, we predict fine-scale details such as wrinkles that depend mostly on the garment material. We qualitatively and quantitatively demonstrate that our fully convolutional approach outperforms existing methods in terms of generalization capabilities and memory requirements, and therefore it opens the door to more general learning-based models for virtual try-on applications.

* Project website http://mslab.es/projects/FullyConvolutionalGraphVirtualTryOn . Accepted to ACM SIGGRAPH / Eurographics Symposium on Computer Animation, 2020

Via

Access Paper or Ask Questions

SoftSMPL: Data-driven Modeling of Nonlinear Soft-tissue Dynamics for Parametric Humans

Apr 01, 2020

Igor Santesteban, Elena Garces, Miguel A. Otaduy, Dan Casas

Figure 1 for SoftSMPL: Data-driven Modeling of Nonlinear Soft-tissue Dynamics for Parametric Humans

Figure 2 for SoftSMPL: Data-driven Modeling of Nonlinear Soft-tissue Dynamics for Parametric Humans

Figure 3 for SoftSMPL: Data-driven Modeling of Nonlinear Soft-tissue Dynamics for Parametric Humans

Figure 4 for SoftSMPL: Data-driven Modeling of Nonlinear Soft-tissue Dynamics for Parametric Humans

Abstract:We present SoftSMPL, a learning-based method to model realistic soft-tissue dynamics as a function of body shape and motion. Datasets to learn such task are scarce and expensive to generate, which makes training models prone to overfitting. At the core of our method there are three key contributions that enable us to model highly realistic dynamics and better generalization capabilities than state-of-the-art methods, while training on the same data. First, a novel motion descriptor that disentangles the standard pose representation by removing subject-specific features; second, a neural-network-based recurrent regressor that generalizes to unseen shapes and motions; and third, a highly efficient nonlinear deformation subspace capable of representing soft-tissue deformations of arbitrary shapes. We demonstrate qualitative and quantitative improvements over existing methods and, additionally, we show the robustness of our method on a variety of motion capture databases.

* Accepted at Eurographics 2020. Project website: http://dancasas.github.io/projects/SoftSMPL

Via

Access Paper or Ask Questions

Learning-Based Animation of Clothing for Virtual Try-On

Mar 17, 2019

Igor Santesteban, Miguel A. Otaduy, Dan Casas

Figure 1 for Learning-Based Animation of Clothing for Virtual Try-On

Figure 2 for Learning-Based Animation of Clothing for Virtual Try-On

Figure 3 for Learning-Based Animation of Clothing for Virtual Try-On

Figure 4 for Learning-Based Animation of Clothing for Virtual Try-On

Abstract:This paper presents a learning-based clothing animation method for highly efficient virtual try-on simulation. Given a garment, we preprocess a rich database of physically-based dressed character simulations, for multiple body shapes and animations. Then, using this database, we train a learning-based model of cloth drape and wrinkles, as a function of body shape and dynamics. We propose a model that separates global garment fit, due to body shape, from local garment wrinkles, due to both pose dynamics and body shape. We use a recurrent neural network to regress garment wrinkles, and we achieve highly plausible nonlinear effects, in contrast to the blending artifacts suffered by previous methods. At runtime, dynamic virtual try-on animations are produced in just a few milliseconds for garments with thousands of triangles. We show qualitative and quantitative analysis of results

* Eurographics 2019

Via

Access Paper or Ask Questions