Picture for Chengzhu Yu

Chengzhu Yu

Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech

Add code
Nov 04, 2022
Viaarxiv icon

Peking Opera Synthesis via Duration Informed Attention Network

Add code
Aug 07, 2020
Figure 1 for Peking Opera Synthesis via Duration Informed Attention Network
Figure 2 for Peking Opera Synthesis via Duration Informed Attention Network
Figure 3 for Peking Opera Synthesis via Duration Informed Attention Network
Figure 4 for Peking Opera Synthesis via Duration Informed Attention Network
Viaarxiv icon

Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network

Add code
Dec 27, 2019
Figure 1 for Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network
Figure 2 for Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network
Figure 3 for Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network
Figure 4 for Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network
Viaarxiv icon

Learning Singing From Speech

Add code
Dec 20, 2019
Figure 1 for Learning Singing From Speech
Figure 2 for Learning Singing From Speech
Figure 3 for Learning Singing From Speech
Viaarxiv icon

PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network

Add code
Dec 04, 2019
Figure 1 for PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network
Figure 2 for PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network
Figure 3 for PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network
Figure 4 for PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network
Viaarxiv icon

Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition

Add code
Nov 28, 2019
Figure 1 for Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition
Figure 2 for Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition
Figure 3 for Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition
Figure 4 for Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition
Viaarxiv icon

DurIAN: Duration Informed Attention Network For Multimodal Synthesis

Add code
Sep 05, 2019
Figure 1 for DurIAN: Duration Informed Attention Network For Multimodal Synthesis
Figure 2 for DurIAN: Duration Informed Attention Network For Multimodal Synthesis
Figure 3 for DurIAN: Duration Informed Attention Network For Multimodal Synthesis
Figure 4 for DurIAN: Duration Informed Attention Network For Multimodal Synthesis
Viaarxiv icon

Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching

Add code
Dec 23, 2018
Figure 1 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Figure 2 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Figure 3 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Figure 4 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Viaarxiv icon

UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation

Add code
Oct 24, 2016
Figure 1 for UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation
Figure 2 for UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation
Figure 3 for UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation
Figure 4 for UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation
Viaarxiv icon