Picture for Leda Sari

Leda Sari

CJST: CTC Compressor based Joint Speech and Text Training for Decoder-Only ASR

Add code
Nov 12, 2024
Viaarxiv icon

Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech

Add code
Oct 02, 2024
Figure 1 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 2 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 3 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 4 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Viaarxiv icon

Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model

Add code
Sep 22, 2023
Viaarxiv icon

Augmenting text for spoken language understanding with Large Language Models

Add code
Sep 17, 2023
Figure 1 for Augmenting text for spoken language understanding with Large Language Models
Figure 2 for Augmenting text for spoken language understanding with Large Language Models
Figure 3 for Augmenting text for spoken language understanding with Large Language Models
Figure 4 for Augmenting text for spoken language understanding with Large Language Models
Viaarxiv icon

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

Add code
Jun 23, 2023
Viaarxiv icon

Self-Supervised Representations for Singing Voice Conversion

Add code
Mar 21, 2023
Viaarxiv icon

Biased Self-supervised learning for ASR

Add code
Nov 04, 2022
Viaarxiv icon

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Add code
Oct 13, 2021
Figure 1 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 2 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 3 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 4 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Viaarxiv icon

Identify Speakers in Cocktail Parties with End-to-End Attention

Add code
May 22, 2020
Figure 1 for Identify Speakers in Cocktail Parties with End-to-End Attention
Figure 2 for Identify Speakers in Cocktail Parties with End-to-End Attention
Figure 3 for Identify Speakers in Cocktail Parties with End-to-End Attention
Figure 4 for Identify Speakers in Cocktail Parties with End-to-End Attention
Viaarxiv icon