Picture for Yuxiang Zhao

Yuxiang Zhao

OpenSTBench: Beyond Semantic Evaluation for Speech Translation

Add code
May 29, 2026
Viaarxiv icon

POINav: Benchmarking and Enhancing Final-Meters Arrival in Real-World Vision-Language Navigation

Add code
May 27, 2026
Viaarxiv icon

Chatting about Conditional Trajectory Prediction

Add code
Apr 20, 2026
Viaarxiv icon

Chatting about Upper-Body Expressive Human Pose and Shape Estimation

Add code
Apr 20, 2026
Viaarxiv icon

X-VC: Zero-shot Streaming Voice Conversion in Codec Space

Add code
Apr 14, 2026
Viaarxiv icon

SoulX-Duplug: Plug-and-Play Streaming State Prediction Module for Realtime Full-Duplex Speech Conversation

Add code
Mar 16, 2026
Viaarxiv icon

ABot-N0: Technical Report on the VLA Foundation Model for Versatile Embodied Navigation

Add code
Feb 12, 2026
Viaarxiv icon

Bridging the Indoor-Outdoor Gap: Vision-Centric Instruction-Guided Embodied Navigation for the Last Meters

Add code
Feb 06, 2026
Viaarxiv icon

UAV-Based Remote Sensing of Soil Moisture Across Diverse Land Covers: Validation and Bayesian Uncertainty Characterization

Add code
Jun 05, 2025
Viaarxiv icon

Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception

Add code
Dec 18, 2024
Figure 1 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Figure 2 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Figure 3 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Figure 4 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Viaarxiv icon