Picture for Xinyuan Chen

Xinyuan Chen

ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions

Add code
Dec 11, 2025
Figure 1 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Figure 2 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Figure 3 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Figure 4 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Viaarxiv icon

LIA-X: Interpretable Latent Portrait Animator

Add code
Aug 13, 2025
Viaarxiv icon

Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers

Add code
Aug 10, 2025
Figure 1 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 2 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 3 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 4 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Viaarxiv icon

Self-Improvement for Audio Large Language Model using Unlabeled Speech

Add code
Jul 27, 2025
Viaarxiv icon

GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects

Add code
Jun 18, 2025
Figure 1 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Figure 2 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Figure 3 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Figure 4 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Viaarxiv icon

Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs

Add code
Jun 08, 2025
Viaarxiv icon

Training-free Stylized Text-to-Image Generation with Fast Inference

Add code
May 25, 2025
Viaarxiv icon

The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

Add code
Apr 16, 2025
Viaarxiv icon

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Add code
Mar 25, 2025
Figure 1 for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Figure 2 for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Figure 3 for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Figure 4 for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Viaarxiv icon

GMG: A Video Prediction Method Based on Global Focus and Motion Guided

Add code
Mar 14, 2025
Figure 1 for GMG: A Video Prediction Method Based on Global Focus and Motion Guided
Figure 2 for GMG: A Video Prediction Method Based on Global Focus and Motion Guided
Figure 3 for GMG: A Video Prediction Method Based on Global Focus and Motion Guided
Figure 4 for GMG: A Video Prediction Method Based on Global Focus and Motion Guided
Viaarxiv icon