Picture for Vijay Ravi

Vijay Ravi

Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals

Add code
Jun 06, 2023
Viaarxiv icon

A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement

Add code
Jun 29, 2022
Figure 1 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 2 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 3 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 4 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Viaarxiv icon

Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals

Add code
Jun 27, 2022
Figure 1 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 2 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 3 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 4 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Viaarxiv icon

Automatic Dialect Density Estimation for African American English

Add code
Apr 03, 2022
Figure 1 for Automatic Dialect Density Estimation for African American English
Figure 2 for Automatic Dialect Density Estimation for African American English
Figure 3 for Automatic Dialect Density Estimation for African American English
Figure 4 for Automatic Dialect Density Estimation for African American English
Viaarxiv icon

FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals

Add code
Feb 11, 2022
Figure 1 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Figure 2 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Figure 3 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Figure 4 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Viaarxiv icon

Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion

Add code
Nov 30, 2020
Figure 1 for Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion
Figure 2 for Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion
Figure 3 for Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion
Figure 4 for Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion
Viaarxiv icon

Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification

Add code
Aug 08, 2020
Figure 1 for Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification
Figure 2 for Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification
Figure 3 for Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification
Viaarxiv icon

Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification

Add code
Aug 08, 2020
Figure 1 for Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification
Figure 2 for Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification
Figure 3 for Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification
Viaarxiv icon