Picture for Shiyi Zhang

Shiyi Zhang

TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

Add code
Jan 12, 2026
Viaarxiv icon

Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

Add code
Jan 08, 2026
Viaarxiv icon

Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation

Add code
Dec 23, 2025
Viaarxiv icon

NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes

Add code
Oct 02, 2025
Figure 1 for NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes
Figure 2 for NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes
Figure 3 for NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes
Figure 4 for NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes
Viaarxiv icon

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Add code
Sep 09, 2025
Viaarxiv icon

HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion

Add code
Jun 06, 2025
Viaarxiv icon

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios

Add code
May 06, 2025
Viaarxiv icon

KV-Edit: Training-Free Image Editing for Precise Background Preservation

Add code
Feb 25, 2025
Figure 1 for KV-Edit: Training-Free Image Editing for Precise Background Preservation
Figure 2 for KV-Edit: Training-Free Image Editing for Precise Background Preservation
Figure 3 for KV-Edit: Training-Free Image Editing for Precise Background Preservation
Figure 4 for KV-Edit: Training-Free Image Editing for Precise Background Preservation
Viaarxiv icon

ColorFlow: Retrieval-Augmented Image Sequence Colorization

Add code
Dec 16, 2024
Viaarxiv icon

Narrative Action Evaluation with Prompt-Guided Multimodal Interaction

Add code
Apr 26, 2024
Viaarxiv icon