Picture for Xin Tao

Xin Tao

Terra: Explorable Native 3D World Model with Point Latents

Add code
Oct 16, 2025
Viaarxiv icon

Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance

Add code
Oct 14, 2025
Viaarxiv icon

Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

Add code
Sep 03, 2025
Viaarxiv icon

Score Augmentation for Diffusion Models

Add code
Aug 11, 2025
Viaarxiv icon

Imbalance in Balance: Online Concept Balancing in Generation Models

Add code
Jul 17, 2025
Viaarxiv icon

Training-Free Efficient Video Generation via Dynamic Token Carving

Add code
May 22, 2025
Figure 1 for Training-Free Efficient Video Generation via Dynamic Token Carving
Figure 2 for Training-Free Efficient Video Generation via Dynamic Token Carving
Figure 3 for Training-Free Efficient Video Generation via Dynamic Token Carving
Figure 4 for Training-Free Efficient Video Generation via Dynamic Token Carving
Viaarxiv icon

VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption

Add code
May 17, 2025
Viaarxiv icon

BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation

Add code
Apr 23, 2025
Viaarxiv icon

Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings

Add code
Mar 24, 2025
Viaarxiv icon

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Add code
Mar 18, 2025
Viaarxiv icon