Picture for Jia Qi Yip

Jia Qi Yip

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Viaarxiv icon

Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions

Add code
Sep 25, 2024
Viaarxiv icon

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech

Add code
Sep 24, 2024
Viaarxiv icon

Continual Learning Optimizations for Auto-regressive Decoder of Multilingual ASR systems

Add code
Jul 04, 2024
Viaarxiv icon

Towards Audio Codec-based Speech Separation

Add code
Jun 18, 2024
Viaarxiv icon

Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis

Add code
Jun 04, 2024
Viaarxiv icon

Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification

Add code
Sep 26, 2023
Viaarxiv icon

SPGM: Prioritizing Local Features for enhanced speech separation performance

Add code
Sep 22, 2023
Viaarxiv icon

Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures

Add code
Sep 14, 2023
Viaarxiv icon

Codec Data Augmentation for Time-domain Heart Sound Classification

Add code
Sep 14, 2023
Viaarxiv icon