Picture for Kai Zhu

Kai Zhu

EditEmoTalk: Controllable Speech-Driven 3D Facial Animation with Continuous Expression Editing

Add code
Jan 15, 2026
Viaarxiv icon

Anchoring Values in Temporal and Group Dimensions for Flow Matching Model Alignment

Add code
Dec 13, 2025
Viaarxiv icon

AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model

Add code
Jun 05, 2025
Viaarxiv icon

Self-Supervised Evolution Operator Learning for High-Dimensional Dynamical Systems

Add code
May 24, 2025
Figure 1 for Self-Supervised Evolution Operator Learning for High-Dimensional Dynamical Systems
Figure 2 for Self-Supervised Evolution Operator Learning for High-Dimensional Dynamical Systems
Figure 3 for Self-Supervised Evolution Operator Learning for High-Dimensional Dynamical Systems
Figure 4 for Self-Supervised Evolution Operator Learning for High-Dimensional Dynamical Systems
Viaarxiv icon

Semantically-Aware Game Image Quality Assessment

Add code
May 16, 2025
Figure 1 for Semantically-Aware Game Image Quality Assessment
Figure 2 for Semantically-Aware Game Image Quality Assessment
Figure 3 for Semantically-Aware Game Image Quality Assessment
Figure 4 for Semantically-Aware Game Image Quality Assessment
Viaarxiv icon

UCS: A Universal Model for Curvilinear Structure Segmentation

Add code
Apr 05, 2025
Figure 1 for UCS: A Universal Model for Curvilinear Structure Segmentation
Figure 2 for UCS: A Universal Model for Curvilinear Structure Segmentation
Figure 3 for UCS: A Universal Model for Curvilinear Structure Segmentation
Figure 4 for UCS: A Universal Model for Curvilinear Structure Segmentation
Viaarxiv icon

Exploring Reliable PPG Authentication on Smartwatches in Daily Scenarios

Add code
Mar 31, 2025
Figure 1 for Exploring Reliable PPG Authentication on Smartwatches in Daily Scenarios
Figure 2 for Exploring Reliable PPG Authentication on Smartwatches in Daily Scenarios
Figure 3 for Exploring Reliable PPG Authentication on Smartwatches in Daily Scenarios
Figure 4 for Exploring Reliable PPG Authentication on Smartwatches in Daily Scenarios
Viaarxiv icon

Wan: Open and Advanced Large-Scale Video Generative Models

Add code
Mar 26, 2025
Figure 1 for Wan: Open and Advanced Large-Scale Video Generative Models
Figure 2 for Wan: Open and Advanced Large-Scale Video Generative Models
Figure 3 for Wan: Open and Advanced Large-Scale Video Generative Models
Figure 4 for Wan: Open and Advanced Large-Scale Video Generative Models
Viaarxiv icon

VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization

Add code
Jan 16, 2025
Figure 1 for VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization
Figure 2 for VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization
Figure 3 for VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization
Figure 4 for VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization
Viaarxiv icon

MangaNinja: Line Art Colorization with Precise Reference Following

Add code
Jan 14, 2025
Figure 1 for MangaNinja: Line Art Colorization with Precise Reference Following
Figure 2 for MangaNinja: Line Art Colorization with Precise Reference Following
Figure 3 for MangaNinja: Line Art Colorization with Precise Reference Following
Figure 4 for MangaNinja: Line Art Colorization with Precise Reference Following
Viaarxiv icon