Picture for Xiao-Ping Zhang

Xiao-Ping Zhang

TetherCache: Stabilizing Autoregressive Long-Form Video Generation with Gated Recall and Trusted Alignment

Add code
Jun 11, 2026
Viaarxiv icon

WorldFly: A World-Model-Based Vision-Language-Action Model for UAV Navigation

Add code
Jun 04, 2026
Viaarxiv icon

PHGNet: Prototype-Guided Hypergraph Construction for Heterogeneous Spatiotemporal Forecasting

Add code
May 25, 2026
Viaarxiv icon

VEN-VL: A Visual Ensemble MoE Framework for Effective and Efficient Multi-Modal Understanding

Add code
May 25, 2026
Viaarxiv icon

Causal Tongue-Tie: LLMs Can Encode Causal Direction, But Their Yes/No Outputs Fail to Express

Add code
May 25, 2026
Viaarxiv icon

ADMFormer: An Adaptive-Decomposition Transformer with Time-Varying Masked Spatial Attention for Traffic Forecasting

Add code
May 25, 2026
Viaarxiv icon

Verifiable Process Rewards for Agentic Reasoning

Add code
May 11, 2026
Viaarxiv icon

OA-WAM: Object-Addressable World Action Model for Robust Robot Manipulation

Add code
May 07, 2026
Viaarxiv icon

GRPO-VPS: Enhancing Group Relative Policy Optimization with Verifiable Process Supervision for Effective Reasoning

Add code
Apr 22, 2026
Viaarxiv icon

Iterative Identification Closure: Amplifying Causal Identifiability in Linear SEMs

Add code
Apr 10, 2026
Viaarxiv icon