Picture for Slava Shechtman

Slava Shechtman

Continuous Speech Synthesis using per-token Latent Diffusion

Add code
Oct 21, 2024
Viaarxiv icon

Low Bitrate High-Quality RVQGAN-based Discrete Speech Tokenizer

Add code
Oct 10, 2024
Figure 1 for Low Bitrate High-Quality RVQGAN-based Discrete Speech Tokenizer
Figure 2 for Low Bitrate High-Quality RVQGAN-based Discrete Speech Tokenizer
Figure 3 for Low Bitrate High-Quality RVQGAN-based Discrete Speech Tokenizer
Figure 4 for Low Bitrate High-Quality RVQGAN-based Discrete Speech Tokenizer
Viaarxiv icon

A Neural TTS System with Parallel Prosody Transfer from Unseen Speakers

Add code
Sep 20, 2023
Viaarxiv icon

Speak While You Think: Streaming Speech Synthesis During Text Generation

Add code
Sep 20, 2023
Figure 1 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Figure 2 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Figure 3 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Figure 4 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Viaarxiv icon

Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis

Add code
Jul 25, 2022
Figure 1 for Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis
Figure 2 for Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis
Figure 3 for Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis
Figure 4 for Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis
Viaarxiv icon

Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis

Add code
Jan 25, 2021
Figure 1 for Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis
Figure 2 for Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis
Figure 3 for Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis
Figure 4 for Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis
Viaarxiv icon