Picture for Yan Lu

Yan Lu

Efficient Autoregressive Video Diffusion with Dummy Head

Add code
Jan 28, 2026
Viaarxiv icon

Hierarchical Long Video Understanding with Audiovisual Entity Cohesion and Agentic Search

Add code
Jan 20, 2026
Viaarxiv icon

Breaking Coordinate Overfitting: Geometry-Aware WiFi Sensing for Cross-Layout 3D Pose Estimation

Add code
Jan 18, 2026
Viaarxiv icon

A Unified Neural Codec Language Model for Selective Editable Text to Speech Generation

Add code
Jan 18, 2026
Viaarxiv icon

From Off-Policy to On-Policy: Enhancing GUI Agents via Bi-level Expert-to-Policy Assimilation

Add code
Jan 09, 2026
Viaarxiv icon

InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

Add code
Jan 08, 2026
Viaarxiv icon

Animate Any Character in Any World

Add code
Dec 18, 2025
Viaarxiv icon

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Add code
Dec 18, 2025
Viaarxiv icon

Spatia: Video Generation with Updatable Spatial Memory

Add code
Dec 17, 2025
Figure 1 for Spatia: Video Generation with Updatable Spatial Memory
Figure 2 for Spatia: Video Generation with Updatable Spatial Memory
Figure 3 for Spatia: Video Generation with Updatable Spatial Memory
Figure 4 for Spatia: Video Generation with Updatable Spatial Memory
Viaarxiv icon

Single-step Diffusion-based Video Coding with Semantic-Temporal Guidance

Add code
Dec 08, 2025
Viaarxiv icon