Picture for Yaohui Wang

Yaohui Wang

CausalMotion: Structured Physical Reasoning as Keyframe and Trajectory Guidance for Training-Free Video Generation

Add code
Jun 12, 2026
Viaarxiv icon

TIDE: Task-Isolated Diffusion for Unified Video Editing and Generation

Add code
Jun 06, 2026
Viaarxiv icon

PARE: Pruning and Adaptive Routing for Efficient Video Generation

Add code
May 26, 2026
Viaarxiv icon

ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions

Add code
Dec 11, 2025
Figure 1 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Figure 2 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Figure 3 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Figure 4 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Viaarxiv icon

THEval. Evaluation Framework for Talking Head Video Generation

Add code
Nov 06, 2025
Viaarxiv icon

LIA-X: Interpretable Latent Portrait Animator

Add code
Aug 13, 2025
Viaarxiv icon

Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers

Add code
Aug 10, 2025
Figure 1 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 2 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 3 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 4 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Viaarxiv icon

GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects

Add code
Jun 18, 2025
Figure 1 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Figure 2 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Figure 3 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Figure 4 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Viaarxiv icon

Research on Aerodynamic Performance Prediction of Airfoils Based on a Fusion Algorithm of Transformer and GAN

Add code
Jun 08, 2025
Viaarxiv icon

Training-free Stylized Text-to-Image Generation with Fast Inference

Add code
May 25, 2025
Viaarxiv icon