Picture for Chin-Hui Lee

Chin-Hui Lee

An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement

Add code
Sep 24, 2024
Viaarxiv icon

Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios

Add code
Jun 21, 2024
Viaarxiv icon

Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design

Add code
Jun 14, 2024
Viaarxiv icon

Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition

Add code
Jun 04, 2024
Viaarxiv icon

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

Add code
Mar 07, 2024
Viaarxiv icon

Bayesian adaptive learning to latent variables via Variational Bayes and Maximum a Posteriori

Add code
Jan 24, 2024
Viaarxiv icon

Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture

Add code
Sep 17, 2023
Viaarxiv icon

Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints

Add code
Sep 16, 2023
Viaarxiv icon

The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction

Add code
Sep 15, 2023
Viaarxiv icon

The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge

Add code
Aug 28, 2023
Viaarxiv icon