Picture for Limin Wang

Limin Wang

Explainable Forensics of Manipulated Segments in Untrimmed Long Videos

Add code
Jun 01, 2026
Viaarxiv icon

StreamOV: Streaming Omni-Video Understanding via Evidence-Guided Memory and Response Triggering

Add code
May 25, 2026
Viaarxiv icon

USV: Towards Understanding the User-generated Short-form Videos

Add code
May 20, 2026
Viaarxiv icon

Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

Add code
Mar 05, 2026
Viaarxiv icon

RIVER: A Real-Time Interaction Benchmark for Video LLMs

Add code
Mar 04, 2026
Viaarxiv icon

LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization

Add code
Feb 02, 2026
Viaarxiv icon

GLAD: Generative Language-Assisted Visual Tracking for Low-Semantic Templates

Add code
Jan 31, 2026
Viaarxiv icon

Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning

Add code
Jan 30, 2026
Viaarxiv icon

VMonarch: Efficient Video Diffusion Transformers with Structured Attention

Add code
Jan 29, 2026
Viaarxiv icon

Towards Pixel-Level VLM Perception via Simple Points Prediction

Add code
Jan 27, 2026
Viaarxiv icon