Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Guoliang Pang

Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations

Dec 04, 2024

Yu Feng, Shunsi Zhang, Jian Shu, Hanfeng Zhao, Guoliang Pang, Chi Zhang, Hao Wang

Figure 1 for Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations

Figure 2 for Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations

Figure 3 for Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations

Figure 4 for Human Multi-View Synthesis from a Single-View Model:Transferred Body and Face Representations

Abstract:Generating multi-view human images from a single view is a complex and significant challenge. Although recent advancements in multi-view object generation have shown impressive results with diffusion models, novel view synthesis for humans remains constrained by the limited availability of 3D human datasets. Consequently, many existing models struggle to produce realistic human body shapes or capture fine-grained facial details accurately. To address these issues, we propose an innovative framework that leverages transferred body and facial representations for multi-view human synthesis. Specifically, we use a single-view model pretrained on a large-scale human dataset to develop a multi-view body representation, aiming to extend the 2D knowledge of the single-view model to a multi-view diffusion model. Additionally, to enhance the model's detail restoration capability, we integrate transferred multimodal facial features into our trained human diffusion model. Experimental evaluations on benchmark datasets demonstrate that our approach outperforms the current state-of-the-art methods, achieving superior performance in multi-view human synthesis.

Via

Access Paper or Ask Questions

MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction

Dec 04, 2024

Gangjian Zhang, Nanjie Yao, Shunsi Zhang, Hanfeng Zhao, Guoliang Pang, Jian Shu, Hao Wang

Figure 1 for MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction

Figure 2 for MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction

Figure 3 for MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction

Figure 4 for MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction

Abstract:This paper investigates the research task of reconstructing the 3D clothed human body from a monocular image. Due to the inherent ambiguity of single-view input, existing approaches leverage pre-trained SMPL(-X) estimation models or generative models to provide auxiliary information for human reconstruction. However, these methods capture only the general human body geometry and overlook specific geometric details, leading to inaccurate skeleton reconstruction, incorrect joint positions, and unclear cloth wrinkles. In response to these issues, we propose a multi-level geometry learning framework. Technically, we design three key components: skeleton-level enhancement, joint-level augmentation, and wrinkle-level refinement modules. Specifically, we effectively integrate the projected 3D Fourier features into a Gaussian reconstruction model, introduce perturbations to improve joint depth estimation during training, and refine the human coarse wrinkles by resembling the de-noising process of diffusion model. Extensive quantitative and qualitative experiments on two out-of-distribution test sets show the superior performance of our approach compared to state-of-the-art (SOTA) methods.

Via

Access Paper or Ask Questions