Picture for Yong Xu

Yong Xu

SpatialEmb: Extract and Encode Spatial Information for 1-Stage Multi-channel Multi-speaker ASR on Arbitrary Microphone Arrays

Add code
Jan 25, 2026
Viaarxiv icon

Accurate Calibration and Robust LiDAR-Inertial Odometry for Spinning Actuated LiDAR Systems

Add code
Jan 24, 2026
Viaarxiv icon

Advancing Adaptive Multi-Stage Video Anomaly Reasoning: A Benchmark Dataset and Method

Add code
Jan 15, 2026
Viaarxiv icon

WearVox: An Egocentric Multichannel Voice Assistant Benchmark for Wearables

Add code
Dec 25, 2025
Viaarxiv icon

User Hesitation and Negative Transfer in Multi-Behavior Recommendation

Add code
Nov 08, 2025
Viaarxiv icon

Multi-Channel Differential ASR for Robust Wearer Speech Recognition on Smart Glasses

Add code
Sep 17, 2025
Viaarxiv icon

Agent4FaceForgery: Multi-Agent LLM Framework for Realistic Face Forgery Detection

Add code
Sep 16, 2025
Viaarxiv icon

Region-Specific Audio Tagging for Spatial Sound

Add code
Sep 11, 2025
Viaarxiv icon

PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models

Add code
Aug 23, 2025
Viaarxiv icon

AU-LLM: Micro-Expression Action Unit Detection via Enhanced LLM-Based Feature Fusion

Add code
Jul 29, 2025
Viaarxiv icon