Picture for Karen Livescu

Karen Livescu

Shammie

MoshiRAG: Asynchronous Knowledge Retrieval for Full-Duplex Speech Language Models

Add code
Apr 14, 2026
Viaarxiv icon

The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties

Add code
Sep 08, 2025
Figure 1 for The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
Figure 2 for The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
Figure 3 for The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
Figure 4 for The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
Viaarxiv icon

On The Landscape of Spoken Language Models: A Comprehensive Survey

Add code
Apr 11, 2025
Figure 1 for On The Landscape of Spoken Language Models: A Comprehensive Survey
Figure 2 for On The Landscape of Spoken Language Models: A Comprehensive Survey
Figure 3 for On The Landscape of Spoken Language Models: A Comprehensive Survey
Figure 4 for On The Landscape of Spoken Language Models: A Comprehensive Survey
Viaarxiv icon

CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition

Add code
Feb 03, 2025
Viaarxiv icon

Chunk-Distilled Language Modeling

Add code
Dec 31, 2024
Viaarxiv icon

SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction

Add code
Nov 25, 2024
Figure 1 for SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction
Figure 2 for SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction
Figure 3 for SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction
Figure 4 for SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction
Viaarxiv icon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Figure 1 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 2 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 3 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 4 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Viaarxiv icon

Speech Recognition for Analysis of Police Radio Communication

Add code
Sep 17, 2024
Figure 1 for Speech Recognition for Analysis of Police Radio Communication
Figure 2 for Speech Recognition for Analysis of Police Radio Communication
Figure 3 for Speech Recognition for Analysis of Police Radio Communication
Figure 4 for Speech Recognition for Analysis of Police Radio Communication
Viaarxiv icon

Approaching Deep Learning through the Spectral Dynamics of Weights

Add code
Aug 21, 2024
Figure 1 for Approaching Deep Learning through the Spectral Dynamics of Weights
Figure 2 for Approaching Deep Learning through the Spectral Dynamics of Weights
Figure 3 for Approaching Deep Learning through the Spectral Dynamics of Weights
Figure 4 for Approaching Deep Learning through the Spectral Dynamics of Weights
Viaarxiv icon

Towards Robust Speech Representation Learning for Thousands of Languages

Add code
Jul 02, 2024
Viaarxiv icon