Picture for Hao Yen

Hao Yen

Efficient Long-Form Speech Recognition for General Speech In-Context Learning

Add code
Sep 29, 2024
Figure 1 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 2 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 3 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 4 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Viaarxiv icon

An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement

Add code
Sep 24, 2024
Figure 1 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 2 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 3 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 4 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Viaarxiv icon

Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition

Add code
Jun 04, 2024
Figure 1 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Figure 2 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Figure 3 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Figure 4 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Viaarxiv icon

Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints

Add code
Sep 16, 2023
Figure 1 for Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints
Figure 2 for Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints
Figure 3 for Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints
Figure 4 for Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints
Viaarxiv icon

Cold Diffusion for Speech Enhancement

Add code
Nov 04, 2022
Viaarxiv icon

Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings

Add code
Oct 30, 2022
Viaarxiv icon

A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming

Add code
Oct 08, 2021
Figure 1 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 2 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 3 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 4 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Viaarxiv icon