Picture for Hao Yen

Hao Yen

A Bottom-up Framework with Language-universal Speech Attribute Modeling for Syllable-based ASR

Add code
Sep 09, 2025
Viaarxiv icon

An Investigation on Combining Geometry and Consistency Constraints into Phase Estimation for Speech Enhancement

Add code
Jul 02, 2025
Viaarxiv icon

Efficient Long-Form Speech Recognition for General Speech In-Context Learning

Add code
Sep 29, 2024
Figure 1 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 2 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 3 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 4 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Viaarxiv icon

An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement

Add code
Sep 24, 2024
Figure 1 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 2 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 3 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 4 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Viaarxiv icon

Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition

Add code
Jun 04, 2024
Figure 1 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Figure 2 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Figure 3 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Figure 4 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Viaarxiv icon

Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints

Add code
Sep 16, 2023
Figure 1 for Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints
Figure 2 for Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints
Figure 3 for Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints
Figure 4 for Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints
Viaarxiv icon

Cold Diffusion for Speech Enhancement

Add code
Nov 04, 2022
Viaarxiv icon

Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings

Add code
Oct 30, 2022
Figure 1 for Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings
Figure 2 for Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings
Figure 3 for Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings
Figure 4 for Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings
Viaarxiv icon

A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming

Add code
Oct 08, 2021
Figure 1 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 2 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 3 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 4 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Viaarxiv icon