Picture for Eng Siong Chng

Eng Siong Chng

Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception

Add code
Oct 14, 2025
Viaarxiv icon

Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models

Add code
Oct 10, 2025
Viaarxiv icon

Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function

Add code
Sep 11, 2025
Figure 1 for Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function
Figure 2 for Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function
Figure 3 for Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function
Figure 4 for Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function
Viaarxiv icon

Zero-shot Context Biasing with Trie-based Decoding using Synthetic Multi-Pronunciation

Add code
Aug 25, 2025
Viaarxiv icon

NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025

Add code
Jun 16, 2025
Viaarxiv icon

Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR

Add code
Jun 16, 2025
Viaarxiv icon

A correlation-permutation approach for speech-music encoders model merging

Add code
Jun 13, 2025
Viaarxiv icon

Speechless: Speech Instruction Training Without Speech for Low Resource Languages

Add code
May 23, 2025
Viaarxiv icon

EASY: Emotion-aware Speaker Anonymization via Factorized Distillation

Add code
May 21, 2025
Viaarxiv icon

Distilling a speech and music encoder with task arithmetic

Add code
May 19, 2025
Viaarxiv icon