Picture for Yuang Li

Yuang Li

Optimizing Speech Multi-View Feature Fusion through Conditional Computation

Add code
Jan 14, 2025
Viaarxiv icon

Investigating Numerical Translation with Large Language Models

Add code
Jan 09, 2025
Figure 1 for Investigating Numerical Translation with Large Language Models
Figure 2 for Investigating Numerical Translation with Large Language Models
Figure 3 for Investigating Numerical Translation with Large Language Models
Viaarxiv icon

"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities

Add code
Dec 26, 2024
Figure 1 for "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
Figure 2 for "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
Figure 3 for "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
Figure 4 for "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
Viaarxiv icon

Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM

Add code
Nov 20, 2024
Figure 1 for Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM
Figure 2 for Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM
Figure 3 for Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM
Figure 4 for Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM
Viaarxiv icon

Why Not Transform Chat Large Language Models to Non-English?

Add code
May 22, 2024
Figure 1 for Why Not Transform Chat Large Language Models to Non-English?
Figure 2 for Why Not Transform Chat Large Language Models to Non-English?
Figure 3 for Why Not Transform Chat Large Language Models to Non-English?
Figure 4 for Why Not Transform Chat Large Language Models to Non-English?
Viaarxiv icon

Cross-Domain Audio Deepfake Detection: Dataset and Analysis

Add code
Apr 07, 2024
Viaarxiv icon

Using Large Language Model for End-to-End Chinese ASR and NER

Add code
Jan 21, 2024
Viaarxiv icon

CB-Whisper: Contextual Biasing Whisper using TTS-based Keyword Spotting

Add code
Sep 18, 2023
Figure 1 for CB-Whisper: Contextual Biasing Whisper using TTS-based Keyword Spotting
Figure 2 for CB-Whisper: Contextual Biasing Whisper using TTS-based Keyword Spotting
Figure 3 for CB-Whisper: Contextual Biasing Whisper using TTS-based Keyword Spotting
Figure 4 for CB-Whisper: Contextual Biasing Whisper using TTS-based Keyword Spotting
Viaarxiv icon

Accelerating Transducers through Adjacent Token Merging

Add code
Jun 28, 2023
Viaarxiv icon

Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition

Add code
Jun 28, 2023
Viaarxiv icon