Picture for Eric Sun

Eric Sun

Target word activity detector: An approach to obtain ASR word boundaries without lexicon

Add code
Sep 20, 2024
Viaarxiv icon

A Survey on Large Language Model Security and Privacy: The Good, the Bad, and the Ugly

Add code
Dec 04, 2023
Viaarxiv icon

Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text

Add code
Jul 30, 2023
Figure 1 for Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text
Figure 2 for Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text
Figure 3 for Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text
Figure 4 for Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text
Viaarxiv icon

Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Add code
Mar 01, 2023
Viaarxiv icon

LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers

Add code
Nov 05, 2022
Figure 1 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Figure 2 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Figure 3 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Figure 4 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Viaarxiv icon

A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability

Add code
Nov 04, 2022
Viaarxiv icon

Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition

Add code
Jan 04, 2022
Figure 1 for Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition
Figure 2 for Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition
Figure 3 for Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition
Figure 4 for Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition
Viaarxiv icon

Multilingual Speech Recognition using Knowledge Transfer across Learning Processes

Add code
Oct 15, 2021
Figure 1 for Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Figure 2 for Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Figure 3 for Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Figure 4 for Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Viaarxiv icon

A Configurable Multilingual Model is All You Need to Recognize All Languages

Add code
Jul 13, 2021
Figure 1 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 2 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 3 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Figure 4 for A Configurable Multilingual Model is All You Need to Recognize All Languages
Viaarxiv icon

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

Add code
Jun 04, 2021
Figure 1 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 2 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 3 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Viaarxiv icon