Picture for Vikas Joshi

Vikas Joshi

Building English ASR model with regional language support

Add code
Mar 10, 2025
Viaarxiv icon

Addressing speaker gender bias in large scale speech translation systems

Add code
Jan 10, 2025
Figure 1 for Addressing speaker gender bias in large scale speech translation systems
Figure 2 for Addressing speaker gender bias in large scale speech translation systems
Figure 3 for Addressing speaker gender bias in large scale speech translation systems
Figure 4 for Addressing speaker gender bias in large scale speech translation systems
Viaarxiv icon

Streaming Bilingual End-to-End ASR model using Attention over Multiple Softmax

Add code
Jan 22, 2024
Viaarxiv icon

Can AI Put Gamma-Ray Astrophysicists Out of a Job?

Add code
Apr 04, 2023
Viaarxiv icon

WavFT: Acoustic model finetuning with labelled and unlabelled data

Add code
Apr 01, 2022
Figure 1 for WavFT: Acoustic model finetuning with labelled and unlabelled data
Figure 2 for WavFT: Acoustic model finetuning with labelled and unlabelled data
Figure 3 for WavFT: Acoustic model finetuning with labelled and unlabelled data
Figure 4 for WavFT: Acoustic model finetuning with labelled and unlabelled data
Viaarxiv icon

Transfer Learning Approaches for Streaming End-to-End Speech Recognition System

Add code
Aug 17, 2020
Figure 1 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Figure 2 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Figure 3 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Figure 4 for Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Viaarxiv icon

Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition

Add code
Jun 09, 2020
Figure 1 for Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Figure 2 for Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Figure 3 for Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Figure 4 for Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Viaarxiv icon

Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition

Add code
Jun 01, 2020
Figure 1 for Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition
Figure 2 for Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition
Figure 3 for Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition
Figure 4 for Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition
Viaarxiv icon

Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition

Add code
Jul 15, 2013
Figure 1 for Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition
Figure 2 for Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition
Figure 3 for Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition
Figure 4 for Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition
Viaarxiv icon