Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech

Nov 14, 2022

Perry Lam, Huayun Zhang, Nancy F. Chen, Berrak Sisman, Dorien Herremans

Figure 1 for SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech

Figure 2 for SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech

Figure 3 for SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech

Figure 4 for SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech

Share this with someone who'll enjoy it:

Abstract:Text-to-speech (TTS) models have achieved remarkable naturalness in recent years, yet like most deep neural models, they have more parameters than necessary. Sparse TTS models can improve on dense models via pruning and extra retraining, or converge faster than dense models with some performance loss. Inspired by these results, we propose training TTS models using a decaying sparsity rate, i.e. a high initial sparsity to accelerate training first, followed by a progressive rate reduction to obtain better eventual performance. This decremental approach differs from current methods of incrementing sparsity to a desired target, which costs significantly more time than dense training. We call our method SNIPER training: Single-shot Initialization Pruning Evolving-Rate training. Our experiments on FastSpeech2 show that although we were only able to obtain better losses in the first few epochs before being overtaken by the baseline, the final SNIPER-trained models beat constant-sparsity models and pip dense models in performance.

View paper on

Share this with someone who'll enjoy it:

Title:SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech

Paper and Code