Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Systematic Exploration of Joint-training for Singing Voice Synthesis

Aug 05, 2023

Yuning Wu, Yifeng Yu, Jiatong Shi, Tao Qian, Qin Jin

Figure 1 for A Systematic Exploration of Joint-training for Singing Voice Synthesis

Figure 2 for A Systematic Exploration of Joint-training for Singing Voice Synthesis

Figure 3 for A Systematic Exploration of Joint-training for Singing Voice Synthesis

Figure 4 for A Systematic Exploration of Joint-training for Singing Voice Synthesis

Share this with someone who'll enjoy it:

Abstract:There has been a growing interest in using end-to-end acoustic models for singing voice synthesis (SVS). Typically, these models require an additional vocoder to transform the generated acoustic features into the final waveform. However, since the acoustic model and the vocoder are not jointly optimized, a gap can exist between the two models, leading to suboptimal performance. Although a similar problem has been addressed in the TTS systems by joint-training or by replacing acoustic features with a latent representation, adopting corresponding approaches to SVS is not an easy task. How to improve the joint-training of SVS systems has not been well explored. In this paper, we conduct a systematic investigation of how to better perform a joint-training of an acoustic model and a vocoder for SVS. We carry out extensive experiments and demonstrate that our joint-training strategy outperforms baselines, achieving more stable performance across different datasets while also increasing the interpretability of the entire framework.

View paper on

Share this with someone who'll enjoy it:

Title:A Systematic Exploration of Joint-training for Singing Voice Synthesis

Paper and Code