Picture for Xin Tao

Xin Tao

Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

Add code
Sep 03, 2025
Viaarxiv icon

Score Augmentation for Diffusion Models

Add code
Aug 11, 2025
Viaarxiv icon

Imbalance in Balance: Online Concept Balancing in Generation Models

Add code
Jul 17, 2025
Viaarxiv icon

Training-Free Efficient Video Generation via Dynamic Token Carving

Add code
May 22, 2025
Viaarxiv icon

VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption

Add code
May 17, 2025
Viaarxiv icon

BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation

Add code
Apr 23, 2025
Viaarxiv icon

Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings

Add code
Mar 24, 2025
Viaarxiv icon

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Add code
Mar 18, 2025
Viaarxiv icon

MTV-Inpaint: Multi-Task Long Video Inpainting

Add code
Mar 14, 2025
Viaarxiv icon

RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification

Add code
Mar 04, 2025
Viaarxiv icon