Picture for Hao Yen

Hao Yen

Efficient Long-Form Speech Recognition for General Speech In-Context Learning

Add code
Sep 29, 2024
Figure 1 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 2 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 3 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Figure 4 for Efficient Long-Form Speech Recognition for General Speech In-Context Learning
Viaarxiv icon

An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement

Add code
Sep 24, 2024
Viaarxiv icon

Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition

Add code
Jun 04, 2024
Viaarxiv icon

Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints

Add code
Sep 16, 2023
Viaarxiv icon

Cold Diffusion for Speech Enhancement

Add code
Nov 04, 2022
Viaarxiv icon

Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings

Add code
Oct 30, 2022
Viaarxiv icon

A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming

Add code
Oct 08, 2021
Figure 1 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 2 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 3 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Figure 4 for A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Viaarxiv icon