Picture for Prashant Sridhar

Prashant Sridhar

DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding

Add code
Jun 13, 2024
Figure 1 for DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding
Figure 2 for DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding
Figure 3 for DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding
Figure 4 for DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding
Viaarxiv icon

Improving ASR Contextual Biasing with Guided Attention

Add code
Jan 16, 2024
Viaarxiv icon

Generative Context-aware Fine-tuning of Self-supervised Speech Models

Add code
Dec 15, 2023
Figure 1 for Generative Context-aware Fine-tuning of Self-supervised Speech Models
Figure 2 for Generative Context-aware Fine-tuning of Self-supervised Speech Models
Figure 3 for Generative Context-aware Fine-tuning of Self-supervised Speech Models
Figure 4 for Generative Context-aware Fine-tuning of Self-supervised Speech Models
Viaarxiv icon

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks

Add code
May 18, 2023
Viaarxiv icon

Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding

Add code
Feb 27, 2023
Viaarxiv icon

Context-aware Fine-tuning of Self-supervised Speech Models

Add code
Dec 16, 2022
Viaarxiv icon

E-Branchformer: Branchformer with Enhanced merging for speech recognition

Add code
Sep 30, 2022
Figure 1 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Figure 2 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Figure 3 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Figure 4 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Viaarxiv icon

Multi-mode Transformer Transducer with Stochastic Future Context

Add code
Jun 17, 2021
Figure 1 for Multi-mode Transformer Transducer with Stochastic Future Context
Figure 2 for Multi-mode Transformer Transducer with Stochastic Future Context
Figure 3 for Multi-mode Transformer Transducer with Stochastic Future Context
Viaarxiv icon

Tuplemax Loss for Language Identification

Add code
Nov 29, 2018
Figure 1 for Tuplemax Loss for Language Identification
Figure 2 for Tuplemax Loss for Language Identification
Figure 3 for Tuplemax Loss for Language Identification
Figure 4 for Tuplemax Loss for Language Identification
Viaarxiv icon

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

Add code
Oct 27, 2018
Figure 1 for VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Figure 2 for VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Figure 3 for VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Figure 4 for VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Viaarxiv icon