Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fanzhou Wang

WHAC: World-grounded Humans and Cameras

Mar 19, 2024

Wanqi Yin, Zhongang Cai, Ruisi Wang, Fanzhou Wang, Chen Wei, Haiyi Mei, Weiye Xiao, Zhitao Yang, Qingping Sun, Atsushi Yamashita(+2 more)

Figure 1 for WHAC: World-grounded Humans and Cameras

Figure 2 for WHAC: World-grounded Humans and Cameras

Figure 3 for WHAC: World-grounded Humans and Cameras

Figure 4 for WHAC: World-grounded Humans and Cameras

Abstract:Estimating human and camera trajectories with accurate scale in the world coordinate system from a monocular video is a highly desirable yet challenging and ill-posed problem. In this study, we aim to recover expressive parametric human models (i.e., SMPL-X) and corresponding camera poses jointly, by leveraging the synergy between three critical players: the world, the human, and the camera. Our approach is founded on two key observations. Firstly, camera-frame SMPL-X estimation methods readily recover absolute human depth. Secondly, human motions inherently provide absolute spatial cues. By integrating these insights, we introduce a novel framework, referred to as WHAC, to facilitate world-grounded expressive human pose and shape estimation (EHPS) alongside camera pose estimation, without relying on traditional optimization techniques. Additionally, we present a new synthetic dataset, WHAC-A-Mole, which includes accurately annotated humans and cameras, and features diverse interactive human motions as well as realistic camera trajectories. Extensive experiments on both standard and newly established benchmarks highlight the superiority and efficacy of our framework. We will make the code and dataset publicly available.

* Homepage: https://wqyin.github.io/projects/WHAC/

Via

Access Paper or Ask Questions

Single-shot fringe projection profilometry based on Deep Learning and Computer Graphics

Jan 04, 2021

Fanzhou Wang, Chenxing Wang, Qingze Guan

Figure 1 for Single-shot fringe projection profilometry based on Deep Learning and Computer Graphics

Figure 2 for Single-shot fringe projection profilometry based on Deep Learning and Computer Graphics

Figure 3 for Single-shot fringe projection profilometry based on Deep Learning and Computer Graphics

Figure 4 for Single-shot fringe projection profilometry based on Deep Learning and Computer Graphics

Abstract:Multiple works have applied deep learning to fringe projection profilometry (FPP) in recent years. However, to obtain a large amount of data from actual systems for training is still a tricky problem, and moreover, the network design and optimization still worth exploring. In this paper, we introduce computer graphics to build virtual FPP systems in order to generate the desired datasets conveniently and simply. The way of constructing a virtual FPP system is described in detail firstly, and then some key factors to set the virtual FPP system much close to the reality are analyzed. With the aim of accurately estimating the depth image from only one fringe image, we also design a new loss function to enhance the quality of the overall and detailed information restored. And two representative networks, U-Net and pix2pix, are compared in multiple aspects. The real experiments prove the good accuracy and generalization of the network trained by the data from our virtual systems and the designed loss, implying the potential of our method for applications.

Via

Access Paper or Ask Questions