Picture for Ke Hu

Ke Hu

VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning

Add code
Oct 23, 2024
Viaarxiv icon

Chain-of-Thought Prompting for Speech Translation

Add code
Sep 17, 2024
Figure 1 for Chain-of-Thought Prompting for Speech Translation
Figure 2 for Chain-of-Thought Prompting for Speech Translation
Figure 3 for Chain-of-Thought Prompting for Speech Translation
Figure 4 for Chain-of-Thought Prompting for Speech Translation
Viaarxiv icon

Robust Principal Component Analysis via Discriminant Sample Weight Learning

Add code
Aug 22, 2024
Viaarxiv icon

Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization

Add code
May 25, 2024
Viaarxiv icon

Enhancing Visual Continual Learning with Language-Guided Supervision

Add code
Mar 24, 2024
Viaarxiv icon

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study

Add code
Jan 23, 2024
Figure 1 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 2 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 3 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 4 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Viaarxiv icon

Feature Norm Regularized Federated Learning: Transforming Skewed Distributions into Global Insights

Add code
Dec 12, 2023
Viaarxiv icon

Improving Joint Speech-Text Representations Without Alignment

Add code
Aug 11, 2023
Figure 1 for Improving Joint Speech-Text Representations Without Alignment
Figure 2 for Improving Joint Speech-Text Representations Without Alignment
Figure 3 for Improving Joint Speech-Text Representations Without Alignment
Figure 4 for Improving Joint Speech-Text Representations Without Alignment
Viaarxiv icon

Mixture-of-Expert Conformer for Streaming Multilingual ASR

Add code
May 25, 2023
Viaarxiv icon

A Deliberation-based Joint Acoustic and Text Decoder

Add code
Mar 23, 2023
Viaarxiv icon