Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Nov 27, 2024

Wentao Wang, Hang Ye, Fangzhou Hong, Xue Yang, Jianfu Zhang, Yizhou Wang, Ziwei Liu, Liang Pan

Figure 1 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Figure 2 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Figure 3 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Figure 4 for GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Share this with someone who'll enjoy it:

Abstract:Given a single in-the-wild human photo, it remains a challenging task to reconstruct a high-fidelity 3D human model. Existing methods face difficulties including a) the varying body proportions captured by in-the-wild human images; b) diverse personal belongings within the shot; and c) ambiguities in human postures and inconsistency in human textures. In addition, the scarcity of high-quality human data intensifies the challenge. To address these problems, we propose a Generalizable image-to-3D huMAN reconstruction framework, dubbed GeneMAN, building upon a comprehensive multi-source collection of high-quality human data, including 3D scans, multi-view videos, single photos, and our generated synthetic human data. GeneMAN encompasses three key modules. 1) Without relying on parametric human models (e.g., SMPL), GeneMAN first trains a human-specific text-to-image diffusion model and a view-conditioned diffusion model, serving as GeneMAN 2D human prior and 3D human prior for reconstruction, respectively. 2) With the help of the pretrained human prior models, the Geometry Initialization-&-Sculpting pipeline is leveraged to recover high-quality 3D human geometry given a single image. 3) To achieve high-fidelity 3D human textures, GeneMAN employs the Multi-Space Texture Refinement pipeline, consecutively refining textures in the latent and the pixel spaces. Extensive experimental results demonstrate that GeneMAN could generate high-quality 3D human models from a single image input, outperforming prior state-of-the-art methods. Notably, GeneMAN could reveal much better generalizability in dealing with in-the-wild images, often yielding high-quality 3D human models in natural poses with common items, regardless of the body proportions in the input images.

* Project page: https://roooooz.github.io/GeneMAN/

View paper on

Share this with someone who'll enjoy it:

Title:GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Paper and Code