Picture for Hyoungmin Park

Hyoungmin Park

Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis

Add code
Nov 19, 2021
Figure 1 for Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis
Figure 2 for Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis
Figure 3 for Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis
Figure 4 for Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis
Viaarxiv icon

Word-Level Style Control for Expressive, Non-attentive Speech Synthesis

Add code
Nov 19, 2021
Figure 1 for Word-Level Style Control for Expressive, Non-attentive Speech Synthesis
Figure 2 for Word-Level Style Control for Expressive, Non-attentive Speech Synthesis
Figure 3 for Word-Level Style Control for Expressive, Non-attentive Speech Synthesis
Figure 4 for Word-Level Style Control for Expressive, Non-attentive Speech Synthesis
Viaarxiv icon

Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control

Add code
Nov 19, 2021
Figure 1 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Figure 2 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Figure 3 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Figure 4 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Viaarxiv icon

Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control

Add code
Nov 17, 2021
Figure 1 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Figure 2 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Figure 3 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Figure 4 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Viaarxiv icon

Cross-lingual Low Resource Speaker Adaptation Using Phonological Features

Add code
Nov 17, 2021
Figure 1 for Cross-lingual Low Resource Speaker Adaptation Using Phonological Features
Figure 2 for Cross-lingual Low Resource Speaker Adaptation Using Phonological Features
Figure 3 for Cross-lingual Low Resource Speaker Adaptation Using Phonological Features
Viaarxiv icon

High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency

Add code
Nov 17, 2021
Figure 1 for High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Figure 2 for High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Figure 3 for High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Viaarxiv icon