Picture for Zikai Zhou

Zikai Zhou

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Add code
Jun 17, 2026
Viaarxiv icon

Qwen-Image-Flash: Beyond Objective Design

Add code
Jun 03, 2026
Viaarxiv icon

Qwen-Image-Bench: From Generation to Creation in Text-to-Image Evaluation

Add code
May 27, 2026
Viaarxiv icon

Qwen-Image-VAE-2.0 Technical Report

Add code
May 13, 2026
Viaarxiv icon

Qwen-Image-2.0 Technical Report

Add code
May 11, 2026
Viaarxiv icon

Lightning Unified Video Editing via In-Context Sparse Attention

Add code
May 06, 2026
Viaarxiv icon

Exploring Data-Free LoRA Transferability for Video Diffusion Models

Add code
May 03, 2026
Viaarxiv icon

Guidance Matters: Rethinking the Evaluation Pitfall for Text-to-Image Generation

Add code
Feb 26, 2026
Viaarxiv icon

Optimizing Few-Step Generation with Adaptive Matching Distillation

Add code
Feb 07, 2026
Viaarxiv icon

PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers

Add code
Feb 03, 2026
Viaarxiv icon