Picture for Yuanzhi Wang

Yuanzhi Wang

FaithfulFaces: Pose-Faithful Facial Identity Preservation for Text-to-Video Generation

Add code
May 06, 2026
Viaarxiv icon

Mixture Prototype Flow Matching for Open-Set Supervised Anomaly Detection

Add code
May 04, 2026
Viaarxiv icon

Anomaly-Preference Image Generation

Add code
May 04, 2026
Viaarxiv icon

Decoupled Hierarchical Distillation for Multimodal Emotion Recognition

Add code
Feb 04, 2026
Viaarxiv icon

Multi-Modal Hypergraph Enhanced LLM Learning for Recommendation

Add code
Apr 13, 2025
Figure 1 for Multi-Modal Hypergraph Enhanced LLM Learning for Recommendation
Figure 2 for Multi-Modal Hypergraph Enhanced LLM Learning for Recommendation
Figure 3 for Multi-Modal Hypergraph Enhanced LLM Learning for Recommendation
Figure 4 for Multi-Modal Hypergraph Enhanced LLM Learning for Recommendation
Viaarxiv icon

Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection

Add code
Feb 28, 2025
Figure 1 for Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection
Figure 2 for Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection
Figure 3 for Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection
Figure 4 for Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection
Viaarxiv icon

Re-Attentional Controllable Video Diffusion Editing

Add code
Dec 16, 2024
Viaarxiv icon

MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation

Add code
Oct 26, 2024
Figure 1 for MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation
Figure 2 for MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation
Figure 3 for MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation
Figure 4 for MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation
Viaarxiv icon

Edit Temporal-Consistent Videos with Image Diffusion Model

Add code
Aug 17, 2023
Figure 1 for Edit Temporal-Consistent Videos with Image Diffusion Model
Figure 2 for Edit Temporal-Consistent Videos with Image Diffusion Model
Figure 3 for Edit Temporal-Consistent Videos with Image Diffusion Model
Figure 4 for Edit Temporal-Consistent Videos with Image Diffusion Model
Viaarxiv icon

Decoupled Multimodal Distilling for Emotion Recognition

Add code
Mar 24, 2023
Figure 1 for Decoupled Multimodal Distilling for Emotion Recognition
Figure 2 for Decoupled Multimodal Distilling for Emotion Recognition
Figure 3 for Decoupled Multimodal Distilling for Emotion Recognition
Figure 4 for Decoupled Multimodal Distilling for Emotion Recognition
Viaarxiv icon