Picture for Xiao Yang

Xiao Yang

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

Add code
May 28, 2026
Viaarxiv icon

MuChator: Enabling Active Music Discovery via Conversational Music LLMs in Douyin Music

Add code
May 26, 2026
Viaarxiv icon

Co-ReAct: Rubrics as Step-Level Collaborators for ReAct Agents

Add code
May 22, 2026
Viaarxiv icon

OnePred: Next-Query Prediction via Recursive Intent Memory in Multi-Turn Conversations

Add code
May 22, 2026
Viaarxiv icon

Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

Add code
May 14, 2026
Viaarxiv icon

GeoVista: Visually Grounded Active Perception for Ultra-High-Resolution Remote Sensing Understanding

Add code
May 14, 2026
Viaarxiv icon

Anatomy-Slot: Unsupervised Anatomical Factorization for Homologous Bilateral Reasoning in Retinal Diagnosis

Add code
May 13, 2026
Viaarxiv icon

AdaFocus: Adaptive Relevance-Diversity Sampling with Zero-Cache Look-back for Efficient Long Video Understanding

Add code
May 13, 2026
Viaarxiv icon

U-HNO: A U-shaped Hybrid Neural Operator with Sparse-Point Adaptive Routing for Non-stationary PDE Dynamics

Add code
May 13, 2026
Viaarxiv icon

Toward Polymorphic Backdoor against Semantic Communication via Intensity-Based Poisoning

Add code
Apr 25, 2026
Viaarxiv icon