Picture for Kshitiz Kumar

Kshitiz Kumar

Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss

Add code
Aug 11, 2023
Figure 1 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Figure 2 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Figure 3 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Figure 4 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Viaarxiv icon

Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study

Add code
Feb 07, 2022
Figure 1 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Figure 2 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Figure 3 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Figure 4 for Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study
Viaarxiv icon

Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models

Add code
Jun 30, 2021
Figure 1 for Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models
Figure 2 for Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models
Figure 3 for Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models
Figure 4 for Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models
Viaarxiv icon

Transfer Learning Approaches for Streaming End-to-End Speech Recognition System

Add code
Aug 17, 2020
Figure 1 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Figure 2 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Figure 3 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Figure 4 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Viaarxiv icon

Speaker Adaptation for End-to-End CTC Models

Add code
Jan 04, 2019
Figure 1 for Speaker Adaptation for End-to-End CTC Models
Figure 2 for Speaker Adaptation for End-to-End CTC Models
Figure 3 for Speaker Adaptation for End-to-End CTC Models
Figure 4 for Speaker Adaptation for End-to-End CTC Models
Viaarxiv icon