Picture for Yuhang Zhang

Yuhang Zhang

Fleximo: Towards Flexible Text-to-Human Motion Video Generation

Add code
Nov 29, 2024
Viaarxiv icon

Act in Collusion: A Persistent Distributed Multi-Target Backdoor in Federated Learning

Add code
Nov 06, 2024
Viaarxiv icon

Curriculum Prompting Foundation Models for Medical Image Segmentation

Add code
Sep 01, 2024
Figure 1 for Curriculum Prompting Foundation Models for Medical Image Segmentation
Figure 2 for Curriculum Prompting Foundation Models for Medical Image Segmentation
Figure 3 for Curriculum Prompting Foundation Models for Medical Image Segmentation
Figure 4 for Curriculum Prompting Foundation Models for Medical Image Segmentation
Viaarxiv icon

Generalizable Facial Expression Recognition

Add code
Aug 20, 2024
Viaarxiv icon

DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion

Add code
Jul 17, 2024
Figure 1 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Figure 2 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Figure 3 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Figure 4 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Viaarxiv icon

FT-AED: Benchmark Dataset for Early Freeway Traffic Anomalous Event Detection

Add code
Jun 24, 2024
Viaarxiv icon

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

Add code
May 09, 2024
Figure 1 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 2 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 3 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 4 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Viaarxiv icon

Beyond Traditional Threats: A Persistent Backdoor Attack on Federated Learning

Add code
Apr 26, 2024
Viaarxiv icon

Faceptor: A Generalist Model for Face Perception

Add code
Mar 14, 2024
Viaarxiv icon

A Middle Way to Traffic Enlightenment

Add code
Jan 29, 2024
Viaarxiv icon