Picture for Xi Yin

Xi Yin

MotiF: Making Text Count in Image Animation with Motion Focal Loss

Add code
Dec 20, 2024
Viaarxiv icon

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

Add code
Dec 19, 2024
Figure 1 for Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
Figure 2 for Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
Figure 3 for Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
Figure 4 for Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

Proactive Schemes: A Survey of Adversarial Attacks for Social Good

Add code
Sep 24, 2024
Figure 1 for Proactive Schemes: A Survey of Adversarial Attacks for Social Good
Figure 2 for Proactive Schemes: A Survey of Adversarial Attacks for Social Good
Figure 3 for Proactive Schemes: A Survey of Adversarial Attacks for Social Good
Figure 4 for Proactive Schemes: A Survey of Adversarial Attacks for Social Good
Viaarxiv icon

AcademicGPT: Empowering Academic Research

Add code
Nov 21, 2023
Viaarxiv icon

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

Add code
Nov 17, 2023
Viaarxiv icon

Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems

Add code
Jun 26, 2023
Viaarxiv icon

Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation

Add code
Apr 18, 2023
Viaarxiv icon

MaLP: Manipulation Localization Using a Proactive Scheme

Add code
Apr 04, 2023
Viaarxiv icon

SpaText: Spatio-Textual Representation for Controllable Image Generation

Add code
Nov 25, 2022
Viaarxiv icon