Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:VIBE: Video Inference for Human Body Pose and Shape Estimation

Dec 11, 2019

Muhammed Kocabas, Nikos Athanasiou, Michael J. Black

Figure 1 for VIBE: Video Inference for Human Body Pose and Shape Estimation

Figure 2 for VIBE: Video Inference for Human Body Pose and Shape Estimation

Figure 3 for VIBE: Video Inference for Human Body Pose and Shape Estimation

Figure 4 for VIBE: Video Inference for Human Body Pose and Shape Estimation

Share this with someone who'll enjoy it:

Abstract:Human motion is fundamental to understanding behavior. Despite progress on single-image 3D pose and shape estimation, existing video-based state-of-the-art methods fail to produce accurate and natural motion sequences due to a lack of ground-truth 3D motion data for training. To address this problem, we propose Video Inference for Body Pose and Shape Estimation (VIBE), which makes use of an existing large-scale motion capture dataset (AMASS) together with unpaired, in-the-wild, 2D keypoint annotations. Our key novelty is an adversarial learning framework that leverages AMASS to discriminate between real human motions and those produced by our temporal pose and shape regression networks. We define a temporal network architecture and show that adversarial training, at the sequence level, produces kinematically plausible motion sequences without in-the-wild ground-truth 3D labels. We perform extensive experimentation to analyze the importance of motion and demonstrate the effectiveness of VIBE on challenging 3D pose estimation datasets, achieving state-of-the-art performance. Code and pretrained models are available at https://github.com/mkocabas/VIBE.

* Tech Report, 13 pages, 6 figures

View paper on

Share this with someone who'll enjoy it:

Title:VIBE: Video Inference for Human Body Pose and Shape Estimation

Paper and Code