Picture for William Chen

William Chen

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Viaarxiv icon

Findings of the IWSLT 2024 Evaluation Campaign

Add code
Nov 07, 2024
Viaarxiv icon

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech

Add code
Sep 24, 2024
Viaarxiv icon

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration

Add code
Sep 14, 2024
Viaarxiv icon

CMU's IWSLT 2024 Simultaneous Speech Translation System

Add code
Aug 14, 2024
Viaarxiv icon

Towards Robust Speech Representation Learning for Thousands of Languages

Add code
Jul 02, 2024
Viaarxiv icon

Nollywood: Let's Go to the Movies!

Add code
Jul 02, 2024
Viaarxiv icon

On the Evaluation of Speech Foundation Models for Spoken Language Understanding

Add code
Jun 14, 2024
Viaarxiv icon

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models

Add code
Jun 13, 2024
Viaarxiv icon

ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets

Add code
Jun 12, 2024
Viaarxiv icon