Picture for Yuhang Zhang

Yuhang Zhang

Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes

Add code
Jan 13, 2025
Viaarxiv icon

Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants

Add code
Jan 05, 2025
Figure 1 for Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
Figure 2 for Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
Figure 3 for Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
Figure 4 for Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
Viaarxiv icon

Fleximo: Towards Flexible Text-to-Human Motion Video Generation

Add code
Nov 29, 2024
Viaarxiv icon

Act in Collusion: A Persistent Distributed Multi-Target Backdoor in Federated Learning

Add code
Nov 06, 2024
Viaarxiv icon

Curriculum Prompting Foundation Models for Medical Image Segmentation

Add code
Sep 01, 2024
Figure 1 for Curriculum Prompting Foundation Models for Medical Image Segmentation
Figure 2 for Curriculum Prompting Foundation Models for Medical Image Segmentation
Figure 3 for Curriculum Prompting Foundation Models for Medical Image Segmentation
Figure 4 for Curriculum Prompting Foundation Models for Medical Image Segmentation
Viaarxiv icon

Generalizable Facial Expression Recognition

Add code
Aug 20, 2024
Viaarxiv icon

DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion

Add code
Jul 17, 2024
Figure 1 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Figure 2 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Figure 3 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Figure 4 for DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Viaarxiv icon

FT-AED: Benchmark Dataset for Early Freeway Traffic Anomalous Event Detection

Add code
Jun 24, 2024
Viaarxiv icon

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

Add code
May 09, 2024
Figure 1 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 2 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 3 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 4 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Viaarxiv icon

Beyond Traditional Threats: A Persistent Backdoor Attack on Federated Learning

Add code
Apr 26, 2024
Viaarxiv icon