Picture for Stanislav Beliaev

Stanislav Beliaev

Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings

Add code
Oct 22, 2021
Figure 1 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Figure 2 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Figure 3 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Figure 4 for Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings
Viaarxiv icon

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction

Add code
Apr 19, 2021
Figure 1 for TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction
Figure 2 for TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction
Figure 3 for TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction
Figure 4 for TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction
Viaarxiv icon

NeMo: a toolkit for building AI applications using Neural Modules

Add code
Sep 14, 2019
Figure 1 for NeMo: a toolkit for building AI applications using Neural Modules
Figure 2 for NeMo: a toolkit for building AI applications using Neural Modules
Viaarxiv icon