Picture for Xiaoshi Wu

Xiaoshi Wu

SemanticGen: Video Generation in Semantic Space

Add code
Dec 24, 2025
Viaarxiv icon

Kling-Omni Technical Report

Add code
Dec 18, 2025
Viaarxiv icon

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Add code
Dec 12, 2025
Viaarxiv icon

Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models

Add code
May 01, 2024
Figure 1 for Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Figure 2 for Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Figure 3 for Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Figure 4 for Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Viaarxiv icon

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Add code
Apr 04, 2024
Figure 1 for CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Figure 2 for CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Figure 3 for CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Figure 4 for CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Viaarxiv icon

ECNet: Effective Controllable Text-to-Image Diffusion Models

Add code
Mar 27, 2024
Viaarxiv icon

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

Add code
Mar 20, 2024
Figure 1 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Figure 2 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Figure 3 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Figure 4 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Viaarxiv icon

JourneyDB: A Benchmark for Generative Image Understanding

Add code
Jul 03, 2023
Viaarxiv icon

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Add code
Jun 15, 2023
Viaarxiv icon

Better Aligning Text-to-Image Models with Human Preference

Add code
Mar 25, 2023
Viaarxiv icon