Picture for Suwon Shon

Suwon Shon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Figure 1 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 2 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 3 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 4 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Viaarxiv icon

Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?

Add code
Aug 12, 2024
Figure 1 for Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
Figure 2 for Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
Figure 3 for Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
Figure 4 for Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
Viaarxiv icon

On the Evaluation of Speech Foundation Models for Spoken Language Understanding

Add code
Jun 14, 2024
Viaarxiv icon

DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding

Add code
Jun 13, 2024
Figure 1 for DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding
Figure 2 for DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding
Figure 3 for DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding
Figure 4 for DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding
Viaarxiv icon

Improving ASR Contextual Biasing with Guided Attention

Add code
Jan 16, 2024
Viaarxiv icon

Generative Context-aware Fine-tuning of Self-supervised Speech Models

Add code
Dec 15, 2023
Figure 1 for Generative Context-aware Fine-tuning of Self-supervised Speech Models
Figure 2 for Generative Context-aware Fine-tuning of Self-supervised Speech Models
Figure 3 for Generative Context-aware Fine-tuning of Self-supervised Speech Models
Figure 4 for Generative Context-aware Fine-tuning of Self-supervised Speech Models
Viaarxiv icon

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks

Add code
May 18, 2023
Viaarxiv icon

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks

Add code
Dec 20, 2022
Viaarxiv icon

Context-aware Fine-tuning of Self-supervised Speech Models

Add code
Dec 16, 2022
Viaarxiv icon

On the Use of External Data for Spoken Named Entity Recognition

Add code
Dec 14, 2021
Figure 1 for On the Use of External Data for Spoken Named Entity Recognition
Figure 2 for On the Use of External Data for Spoken Named Entity Recognition
Figure 3 for On the Use of External Data for Spoken Named Entity Recognition
Figure 4 for On the Use of External Data for Spoken Named Entity Recognition
Viaarxiv icon