Picture for Siyuan Wang

Siyuan Wang

Fudan University

Bridging Vision and Language Concepts through Optimal Transport Semantic Flow

Add code
Jun 25, 2026
Viaarxiv icon

SCAN: Enhance Time Series Anomaly Detection via Multi-Scale Neighborhood-Centered Clustering

Add code
Jun 17, 2026
Viaarxiv icon

Can Agents Read the Room? Benchmarking Visual Social Intelligence in Multimodal Simulation

Add code
Jun 13, 2026
Viaarxiv icon

OneRetrieval: Unifying Multi-Branch E-commerce Retrieval with an Editable Generative Model

Add code
Jun 11, 2026
Viaarxiv icon

LIBERO-Occ: Evaluating and Improving Vision-Language-Action Models under Scene-Induced Occlusion via Viewpoint Imagination

Add code
Jun 09, 2026
Viaarxiv icon

LiAuto-GeoX: Efficient Grounded Driving Transformer

Add code
Jun 04, 2026
Viaarxiv icon

HyLaT: Efficient Multi-Agent Communication via Hybrid Latent-Text Protocol

Add code
May 25, 2026
Viaarxiv icon

SpatialAnt: Autonomous Zero-Shot Robot Navigation via Active Scene Reconstruction and Visual Anticipation

Add code
Mar 27, 2026
Viaarxiv icon

OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework

Add code
Mar 25, 2026
Viaarxiv icon

Unlocking the Value of Text: Event-Driven Reasoning and Multi-Level Alignment for Time Series Forecasting

Add code
Mar 16, 2026
Viaarxiv icon