Picture for Mike Zheng Shou

Mike Zheng Shou

Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model

Add code
Apr 08, 2025
Viaarxiv icon

AssistPDA: An Online Video Surveillance Assistant for Video Anomaly Prediction, Detection, and Analysis

Add code
Mar 27, 2025
Viaarxiv icon

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Add code
Mar 25, 2025
Viaarxiv icon

Impossible Videos

Add code
Mar 18, 2025
Viaarxiv icon

Edit Transfer: Learning Image Editing via Vision In-Context Relations

Add code
Mar 17, 2025
Viaarxiv icon

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

Add code
Mar 17, 2025
Viaarxiv icon

VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary

Add code
Mar 12, 2025
Viaarxiv icon

TPDiff: Temporal Pyramid Video Diffusion Model

Add code
Mar 12, 2025
Viaarxiv icon

In-Context Defense in Computer Agents: An Empirical Study

Add code
Mar 12, 2025
Viaarxiv icon

Balanced Image Stylization with Style Matching Score

Add code
Mar 10, 2025
Viaarxiv icon