Picture for Xiao Yang

Xiao Yang

GeoVista: Visually Grounded Active Perception for Ultra-High-Resolution Remote Sensing Understanding

Add code
May 14, 2026
Viaarxiv icon

Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

Add code
May 14, 2026
Viaarxiv icon

U-HNO: A U-shaped Hybrid Neural Operator with Sparse-Point Adaptive Routing for Non-stationary PDE Dynamics

Add code
May 13, 2026
Viaarxiv icon

AdaFocus: Adaptive Relevance-Diversity Sampling with Zero-Cache Look-back for Efficient Long Video Understanding

Add code
May 13, 2026
Viaarxiv icon

Anatomy-Slot: Unsupervised Anatomical Factorization for Homologous Bilateral Reasoning in Retinal Diagnosis

Add code
May 13, 2026
Viaarxiv icon

Toward Polymorphic Backdoor against Semantic Communication via Intensity-Based Poisoning

Add code
Apr 25, 2026
Viaarxiv icon

CDPR: Cross-modal Diffusion with Polarization for Reliable Monocular Depth Estimation

Add code
Apr 13, 2026
Viaarxiv icon

Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training?

Add code
Apr 12, 2026
Viaarxiv icon

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Add code
Apr 02, 2026
Viaarxiv icon

Mind over Space: Can Multimodal Large Language Models Mentally Navigate?

Add code
Mar 23, 2026
Viaarxiv icon