Picture for Bajibabu Bollepalli

Bajibabu Bollepalli

Distribution augmentation for low-resource expressive text-to-speech

Add code
Feb 19, 2022
Figure 1 for Distribution augmentation for low-resource expressive text-to-speech
Figure 2 for Distribution augmentation for low-resource expressive text-to-speech
Figure 3 for Distribution augmentation for low-resource expressive text-to-speech
Figure 4 for Distribution augmentation for low-resource expressive text-to-speech
Viaarxiv icon

Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks

Add code
Jan 05, 2022
Figure 1 for Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks
Figure 2 for Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks
Figure 3 for Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks
Figure 4 for Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks
Viaarxiv icon

Multi-Scale Spectrogram Modelling for Neural Text-to-Speech

Add code
Jun 29, 2021
Figure 1 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Figure 2 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Figure 3 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Figure 4 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Viaarxiv icon

GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram

Add code
Apr 10, 2019
Figure 1 for GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram
Figure 2 for GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram
Figure 3 for GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram
Viaarxiv icon

Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis

Add code
Mar 14, 2019
Figure 1 for Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis
Figure 2 for Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis
Figure 3 for Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis
Figure 4 for Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis
Viaarxiv icon

Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks

Add code
Oct 30, 2018
Figure 1 for Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks
Figure 2 for Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks
Figure 3 for Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks
Figure 4 for Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks
Viaarxiv icon

Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention

Add code
Oct 29, 2018
Figure 1 for Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention
Figure 2 for Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention
Figure 3 for Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention
Figure 4 for Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention
Viaarxiv icon

Speaker-independent raw waveform model for glottal excitation

Add code
Apr 25, 2018
Figure 1 for Speaker-independent raw waveform model for glottal excitation
Figure 2 for Speaker-independent raw waveform model for glottal excitation
Figure 3 for Speaker-independent raw waveform model for glottal excitation
Figure 4 for Speaker-independent raw waveform model for glottal excitation
Viaarxiv icon

Speech waveform synthesis from MFCC sequences with generative adversarial networks

Add code
Apr 03, 2018
Figure 1 for Speech waveform synthesis from MFCC sequences with generative adversarial networks
Figure 2 for Speech waveform synthesis from MFCC sequences with generative adversarial networks
Figure 3 for Speech waveform synthesis from MFCC sequences with generative adversarial networks
Figure 4 for Speech waveform synthesis from MFCC sequences with generative adversarial networks
Viaarxiv icon

DNN-based Speech Synthesis for Indian Languages from ASCII text

Add code
Aug 18, 2016
Figure 1 for DNN-based Speech Synthesis for Indian Languages from ASCII text
Figure 2 for DNN-based Speech Synthesis for Indian Languages from ASCII text
Figure 3 for DNN-based Speech Synthesis for Indian Languages from ASCII text
Figure 4 for DNN-based Speech Synthesis for Indian Languages from ASCII text
Viaarxiv icon