Picture for Iván Vallés-Pérez

Iván Vallés-Pérez

Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations

Add code
Feb 05, 2024
Viaarxiv icon

Empirical study of the modulus as activation function in computer vision applications

Add code
Jan 15, 2023
Viaarxiv icon

Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech

Add code
Nov 04, 2022
Viaarxiv icon

Approaching sales forecasting using recurrent neural networks and transformers

Add code
Apr 16, 2022
Figure 1 for Approaching sales forecasting using recurrent neural networks and transformers
Figure 2 for Approaching sales forecasting using recurrent neural networks and transformers
Figure 3 for Approaching sales forecasting using recurrent neural networks and transformers
Figure 4 for Approaching sales forecasting using recurrent neural networks and transformers
Viaarxiv icon

End-to-end Keyword Spotting using Xception-1d

Add code
Oct 09, 2021
Figure 1 for End-to-end Keyword Spotting using Xception-1d
Figure 2 for End-to-end Keyword Spotting using Xception-1d
Figure 3 for End-to-end Keyword Spotting using Xception-1d
Viaarxiv icon

Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows

Add code
Jun 10, 2021
Figure 1 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 2 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 3 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Figure 4 for Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows
Viaarxiv icon