Picture for Eng Siong Chng

Eng Siong Chng

NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025

Add code
Jun 16, 2025
Viaarxiv icon

Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR

Add code
Jun 16, 2025
Viaarxiv icon

A correlation-permutation approach for speech-music encoders model merging

Add code
Jun 13, 2025
Viaarxiv icon

Speechless: Speech Instruction Training Without Speech for Low Resource Languages

Add code
May 23, 2025
Viaarxiv icon

EASY: Emotion-aware Speaker Anonymization via Factorized Distillation

Add code
May 21, 2025
Viaarxiv icon

Distilling a speech and music encoder with task arithmetic

Add code
May 19, 2025
Viaarxiv icon

Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding

Add code
May 12, 2025
Viaarxiv icon

UniArray: Unified Spectral-Spatial Modeling for Array-Geometry-Agnostic Speech Separation

Add code
Mar 07, 2025
Viaarxiv icon

Speech Enhancement Using Continuous Embeddings of Neural Audio Codec

Add code
Feb 22, 2025
Viaarxiv icon

Audio Large Language Models Can Be Descriptive Speech Quality Evaluators

Add code
Jan 27, 2025
Viaarxiv icon