Picture for Weining Wang

Weining Wang

MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation

Add code
Oct 02, 2024
Viaarxiv icon

COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation

Add code
Oct 02, 2024
Figure 1 for COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Figure 2 for COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Figure 3 for COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Figure 4 for COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Viaarxiv icon

GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER

Add code
Sep 23, 2023
Viaarxiv icon

VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Add code
Apr 17, 2023
Viaarxiv icon

Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation

Add code
Mar 29, 2023
Viaarxiv icon

MOSO: Decomposing MOtion, Scene and Object for Video Prediction

Add code
Mar 16, 2023
Viaarxiv icon

Learning Disentangled Representation for One-shot Progressive Face Swapping

Add code
Mar 24, 2022
Figure 1 for Learning Disentangled Representation for One-shot Progressive Face Swapping
Figure 2 for Learning Disentangled Representation for One-shot Progressive Face Swapping
Figure 3 for Learning Disentangled Representation for One-shot Progressive Face Swapping
Figure 4 for Learning Disentangled Representation for One-shot Progressive Face Swapping
Viaarxiv icon

Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing

Add code
Sep 06, 2021
Figure 1 for Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing
Figure 2 for Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing
Figure 3 for Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing
Figure 4 for Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing
Viaarxiv icon

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation

Add code
Jul 06, 2021
Figure 1 for OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation
Figure 2 for OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation
Figure 3 for OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation
Figure 4 for OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation
Viaarxiv icon

Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network

Add code
Jun 29, 2021
Figure 1 for Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network
Figure 2 for Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network
Figure 3 for Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network
Figure 4 for Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network
Viaarxiv icon