Picture for Heung-Yeung Shum

Heung-Yeung Shum

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon

Taming Teacher Forcing for Masked Autoregressive Video Generation

Add code
Jan 21, 2025
Viaarxiv icon

Multi-matrix Factorization Attention

Add code
Dec 26, 2024
Viaarxiv icon

Large Investment Model

Add code
Aug 22, 2024
Figure 1 for Large Investment Model
Figure 2 for Large Investment Model
Figure 3 for Large Investment Model
Figure 4 for Large Investment Model
Viaarxiv icon

Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control

Add code
Jun 05, 2024
Figure 1 for Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control
Figure 2 for Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control
Figure 3 for Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control
Figure 4 for Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control
Viaarxiv icon

Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation

Add code
Jun 04, 2024
Figure 1 for Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Figure 2 for Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Figure 3 for Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Figure 4 for Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Viaarxiv icon

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts

Add code
Mar 13, 2024
Figure 1 for Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts
Figure 2 for Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts
Figure 3 for Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts
Figure 4 for Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts
Viaarxiv icon

HumanTOMATO: Text-aligned Whole-body Motion Generation

Add code
Oct 19, 2023
Viaarxiv icon

TOSS:High-quality Text-guided Novel View Synthesis from a Single Image

Add code
Oct 16, 2023
Figure 1 for TOSS:High-quality Text-guided Novel View Synthesis from a Single Image
Figure 2 for TOSS:High-quality Text-guided Novel View Synthesis from a Single Image
Figure 3 for TOSS:High-quality Text-guided Novel View Synthesis from a Single Image
Figure 4 for TOSS:High-quality Text-guided Novel View Synthesis from a Single Image
Viaarxiv icon

Reinforced Disentanglement for Face Swapping without Skip Connection

Add code
Aug 03, 2023
Figure 1 for Reinforced Disentanglement for Face Swapping without Skip Connection
Figure 2 for Reinforced Disentanglement for Face Swapping without Skip Connection
Figure 3 for Reinforced Disentanglement for Face Swapping without Skip Connection
Figure 4 for Reinforced Disentanglement for Face Swapping without Skip Connection
Viaarxiv icon