Picture for Shiyi Zhang

Shiyi Zhang

Ultra Flash: Scaling Real-Time Streaming Video Generation to High Resolutions

Add code
Jun 08, 2026
Viaarxiv icon

Echo-Memory: A Controlled Study of Memory in Action World Models

Add code
Jun 08, 2026
Viaarxiv icon

Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation

Add code
Jun 03, 2026
Viaarxiv icon

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

Add code
Apr 27, 2026
Viaarxiv icon

Generative Visual Chain-of-Thought for Image Editing

Add code
Mar 02, 2026
Viaarxiv icon

ChatUMM: Robust Context Tracking for Conversational Interleaved Generation

Add code
Feb 06, 2026
Viaarxiv icon

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Add code
Feb 06, 2026
Viaarxiv icon

TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

Add code
Jan 12, 2026
Viaarxiv icon

Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

Add code
Jan 08, 2026
Viaarxiv icon

Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation

Add code
Dec 23, 2025
Viaarxiv icon