Picture for Hao Yang

Hao Yang

Chain-of-Description: What I can understand, I can put into words

Add code
Feb 22, 2025
Viaarxiv icon

Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders

Add code
Feb 21, 2025
Viaarxiv icon

Goku: Flow Based Video Generative Foundation Models

Add code
Feb 10, 2025
Viaarxiv icon

A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems

Add code
Feb 10, 2025
Viaarxiv icon

Gravity Compensation of the dVRK-Si Patient Side Manipulator based on Dynamic Model Identification

Add code
Jan 31, 2025
Viaarxiv icon

StereoGen: High-quality Stereo Image Generation from a Single Image

Add code
Jan 15, 2025
Viaarxiv icon

Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation

Add code
Jan 15, 2025
Figure 1 for Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation
Figure 2 for Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation
Figure 3 for Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation
Figure 4 for Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation
Viaarxiv icon

Optimizing Speech Multi-View Feature Fusion through Conditional Computation

Add code
Jan 14, 2025
Viaarxiv icon

Investigating Numerical Translation with Large Language Models

Add code
Jan 09, 2025
Figure 1 for Investigating Numerical Translation with Large Language Models
Figure 2 for Investigating Numerical Translation with Large Language Models
Figure 3 for Investigating Numerical Translation with Large Language Models
Viaarxiv icon

"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities

Add code
Dec 26, 2024
Figure 1 for "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
Figure 2 for "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
Figure 3 for "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
Figure 4 for "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
Viaarxiv icon