Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Applying the Information Bottleneck Principle to Prosodic Representation Learning

Aug 05, 2021

Guangyan Zhang, Ying Qin, Daxin Tan, Tan Lee

Figure 1 for Applying the Information Bottleneck Principle to Prosodic Representation Learning

Figure 2 for Applying the Information Bottleneck Principle to Prosodic Representation Learning

Figure 3 for Applying the Information Bottleneck Principle to Prosodic Representation Learning

Figure 4 for Applying the Information Bottleneck Principle to Prosodic Representation Learning

Share this with someone who'll enjoy it:

Abstract:This paper describes a novel design of a neural network-based speech generation model for learning prosodic representation.The problem of representation learning is formulated according to the information bottleneck (IB) principle. A modified VQ-VAE quantized layer is incorporated in the speech generation model to control the IB capacity and adjust the balance between reconstruction power and disentangle capability of the learned representation. The proposed model is able to learn word-level prosodic representations from speech data. With an optimized IB capacity, the learned representations not only are adequate to reconstruct the original speech but also can be used to transfer the prosody onto different textual content. Extensive results of the objective and subjective evaluation are presented to demonstrate the effect of IB capacity control, the effectiveness, and potential usage of the learned prosodic representation in controllable neural speech generation.

* To be appeared in Interspeech 2021

View paper on

Share this with someone who'll enjoy it:

Title:Applying the Information Bottleneck Principle to Prosodic Representation Learning

Paper and Code