Picture for Yinghui Huang

Yinghui Huang

Large-scale Language Model Rescoring on Long-form Data

Add code
Jun 13, 2023
Viaarxiv icon

Modular Hybrid Autoregressive Transducer

Add code
Oct 31, 2022
Viaarxiv icon

Non-Parallel Voice Conversion for ASR Augmentation

Add code
Sep 15, 2022
Figure 1 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 2 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 3 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 4 for Non-Parallel Voice Conversion for ASR Augmentation
Viaarxiv icon

Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition

Add code
Sep 13, 2022
Figure 1 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Figure 2 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Figure 3 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Figure 4 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Viaarxiv icon

Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems

Add code
Oct 08, 2020
Figure 1 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 2 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 3 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 4 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Viaarxiv icon

End-to-End Spoken Language Understanding Without Full Transcripts

Add code
Sep 30, 2020
Figure 1 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 2 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 3 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 4 for End-to-End Spoken Language Understanding Without Full Transcripts
Viaarxiv icon

English Broadcast News Speech Recognition by Humans and Machines

Add code
Apr 30, 2019
Figure 1 for English Broadcast News Speech Recognition by Humans and Machines
Figure 2 for English Broadcast News Speech Recognition by Humans and Machines
Figure 3 for English Broadcast News Speech Recognition by Humans and Machines
Figure 4 for English Broadcast News Speech Recognition by Humans and Machines
Viaarxiv icon

Looking for ELMo's friends: Sentence-Level Pretraining Beyond Language Modeling

Add code
Dec 28, 2018
Figure 1 for Looking for ELMo's friends: Sentence-Level Pretraining Beyond Language Modeling
Figure 2 for Looking for ELMo's friends: Sentence-Level Pretraining Beyond Language Modeling
Figure 3 for Looking for ELMo's friends: Sentence-Level Pretraining Beyond Language Modeling
Figure 4 for Looking for ELMo's friends: Sentence-Level Pretraining Beyond Language Modeling
Viaarxiv icon