Picture for Hao Huang

Hao Huang

HALO: A Unified Vision-Language-Action Model for Embodied Multimodal Chain-of-Thought Reasoning

Add code
Feb 24, 2026
Viaarxiv icon

UrbanAlign: Post-hoc Semantic Calibration for VLM-Human Preference Alignment

Add code
Feb 23, 2026
Viaarxiv icon

CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment

Add code
Feb 23, 2026
Viaarxiv icon

ProphetKV: User-Query-Driven Selective Recomputation for Efficient KV Cache Reuse in Retrieval-Augmented Generation

Add code
Jan 31, 2026
Viaarxiv icon

MVGD-Net: A Novel Motion-aware Video Glass Surface Detection Network

Add code
Jan 20, 2026
Viaarxiv icon

OxygenREC: An Instruction-Following Generative Framework for E-commerce Recommendation

Add code
Dec 31, 2025
Viaarxiv icon

Phoneme-based speech recognition driven by large language models and sampling marginalization

Add code
Dec 20, 2025
Viaarxiv icon

Multi-Intent Spoken Language Understanding: Methods, Trends, and Challenges

Add code
Dec 12, 2025
Viaarxiv icon

SGMAGNet: A Baseline Model for 3D Cloud Phase Structure Reconstruction on a New Passive Active Satellite Benchmark

Add code
Sep 19, 2025
Viaarxiv icon

SHREC 2025: Protein surface shape retrieval including electrostatic potential

Add code
Sep 16, 2025
Figure 1 for SHREC 2025: Protein surface shape retrieval including electrostatic potential
Figure 2 for SHREC 2025: Protein surface shape retrieval including electrostatic potential
Figure 3 for SHREC 2025: Protein surface shape retrieval including electrostatic potential
Figure 4 for SHREC 2025: Protein surface shape retrieval including electrostatic potential
Viaarxiv icon