Picture for Kwanghee Choi

Kwanghee Choi

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Viaarxiv icon

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration

Add code
Sep 14, 2024
Viaarxiv icon

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models

Add code
Jun 13, 2024
Viaarxiv icon

Self-Supervised Speech Representations are More Phonetic than Semantic

Add code
Jun 12, 2024
Figure 1 for Self-Supervised Speech Representations are More Phonetic than Semantic
Figure 2 for Self-Supervised Speech Representations are More Phonetic than Semantic
Figure 3 for Self-Supervised Speech Representations are More Phonetic than Semantic
Figure 4 for Self-Supervised Speech Representations are More Phonetic than Semantic
Viaarxiv icon

Wav2Gloss: Generating Interlinear Glossed Text from Speech

Add code
Mar 19, 2024
Viaarxiv icon

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Add code
Jan 30, 2024
Figure 1 for OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
Figure 2 for OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
Figure 3 for OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
Figure 4 for OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
Viaarxiv icon

Understanding Probe Behaviors through Variational Bounds of Mutual Information

Add code
Dec 15, 2023
Viaarxiv icon

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

Add code
Sep 27, 2023
Figure 1 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 2 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 3 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 4 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Viaarxiv icon

Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification

Add code
May 28, 2023
Viaarxiv icon

Unsupervised Speech Representation Pooling Using Vector Quantization

Add code
Apr 08, 2023
Viaarxiv icon