Picture for Suyoun Kim

Suyoun Kim

Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech

Add code
Oct 02, 2024
Figure 1 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 2 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 3 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 4 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Viaarxiv icon

PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding

Add code
Jun 12, 2024
Figure 1 for PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding
Figure 2 for PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding
Figure 3 for PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding
Figure 4 for PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding
Viaarxiv icon

Augmenting text for spoken language understanding with Large Language Models

Add code
Sep 17, 2023
Figure 1 for Augmenting text for spoken language understanding with Large Language Models
Figure 2 for Augmenting text for spoken language understanding with Large Language Models
Figure 3 for Augmenting text for spoken language understanding with Large Language Models
Figure 4 for Augmenting text for spoken language understanding with Large Language Models
Viaarxiv icon

Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding

Add code
Jul 22, 2023
Viaarxiv icon

Introducing Semantics into Speech Encoders

Add code
Nov 15, 2022
Viaarxiv icon

Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition

Add code
Oct 31, 2022
Viaarxiv icon

Deliberation Model for On-Device Spoken Language Understanding

Add code
Apr 04, 2022
Figure 1 for Deliberation Model for On-Device Spoken Language Understanding
Figure 2 for Deliberation Model for On-Device Spoken Language Understanding
Figure 3 for Deliberation Model for On-Device Spoken Language Understanding
Figure 4 for Deliberation Model for On-Device Spoken Language Understanding
Viaarxiv icon

Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric

Add code
Oct 11, 2021
Figure 1 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 2 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 3 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 4 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Viaarxiv icon

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion

Add code
Apr 05, 2021
Figure 1 for Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Figure 2 for Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Figure 3 for Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Figure 4 for Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Viaarxiv icon

Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding

Add code
Apr 05, 2021
Figure 1 for Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Figure 2 for Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Figure 3 for Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Figure 4 for Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Viaarxiv icon