Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chuqiao Li

Unimotion: Unifying 3D Human Motion Synthesis and Understanding

Sep 24, 2024

Chuqiao Li, Julian Chibane, Yannan He, Naama Pearl, Andreas Geiger, Gerard Pons-moll

Figure 1 for Unimotion: Unifying 3D Human Motion Synthesis and Understanding

Figure 2 for Unimotion: Unifying 3D Human Motion Synthesis and Understanding

Figure 3 for Unimotion: Unifying 3D Human Motion Synthesis and Understanding

Figure 4 for Unimotion: Unifying 3D Human Motion Synthesis and Understanding

Abstract:We introduce Unimotion, the first unified multi-task human motion model capable of both flexible motion control and frame-level motion understanding. While existing works control avatar motion with global text conditioning, or with fine-grained per frame scripts, none can do both at once. In addition, none of the existing works can output frame-level text paired with the generated poses. In contrast, Unimotion allows to control motion with global text, or local frame-level text, or both at once, providing more flexible control for users. Importantly, Unimotion is the first model which by design outputs local text paired with the generated poses, allowing users to know what motion happens and when, which is necessary for a wide range of applications. We show Unimotion opens up new applications: 1.) Hierarchical control, allowing users to specify motion at different levels of detail, 2.) Obtaining motion text descriptions for existing MoCap data or YouTube videos 3.) Allowing for editability, generating motion from text, and editing the motion via text edits. Moreover, Unimotion attains state-of-the-art results for the frame-level text-to-motion task on the established HumanML3D dataset. The pre-trained model and code are available available on our project page at https://coral79.github.io/Unimotion/.

Via

Access Paper or Ask Questions

A Continual Deepfake Detection Benchmark: Dataset, Methods, and Essentials

May 14, 2022

Chuqiao Li, Zhiwu Huang, Danda Pani Paudel, Yabin Wang, Mohamad Shahbazi, Xiaopeng Hong, Luc Van Gool

Figure 1 for A Continual Deepfake Detection Benchmark: Dataset, Methods, and Essentials

Figure 2 for A Continual Deepfake Detection Benchmark: Dataset, Methods, and Essentials

Figure 3 for A Continual Deepfake Detection Benchmark: Dataset, Methods, and Essentials

Figure 4 for A Continual Deepfake Detection Benchmark: Dataset, Methods, and Essentials

Abstract:There have been emerging a number of benchmarks and techniques for the detection of deepfakes. However, very few works study the detection of incrementally appearing deepfakes in the real-world scenarios. To simulate the wild scenes, this paper suggests a continual deepfake detection benchmark (CDDB) over a new collection of deepfakes from both known and unknown generative models. The suggested CDDB designs multiple evaluations on the detection over easy, hard, and long sequence of deepfake tasks, with a set of appropriate measures. In addition, we exploit multiple approaches to adapt multiclass incremental learning methods, commonly used in the continual visual recognition, to the continual deepfake detection problem. We evaluate several methods, including the adapted ones, on the proposed CDDB. Within the proposed benchmark, we explore some commonly known essentials of standard continual learning. Our study provides new insights on these essentials in the context of continual deepfake detection. The suggested CDDB is clearly more challenging than the existing benchmarks, which thus offers a suitable evaluation avenue to the future research. Our benchmark dataset and the source code will be made publicly available.

* some typos are corrected

Via

Access Paper or Ask Questions