Picture for Guillermo Cámbara

Guillermo Cámbara

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Add code
Feb 15, 2024
Figure 1 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Figure 2 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Figure 3 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Figure 4 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Viaarxiv icon

Data Augmentation for Low-Resource Quechua ASR Improvement

Add code
Jul 14, 2022
Figure 1 for Data Augmentation for Low-Resource Quechua ASR Improvement
Figure 2 for Data Augmentation for Low-Resource Quechua ASR Improvement
Figure 3 for Data Augmentation for Low-Resource Quechua ASR Improvement
Figure 4 for Data Augmentation for Low-Resource Quechua ASR Improvement
Viaarxiv icon

Voice Quality and Pitch Features in Transformer-Based Speech Recognition

Add code
Dec 21, 2021
Figure 1 for Voice Quality and Pitch Features in Transformer-Based Speech Recognition
Figure 2 for Voice Quality and Pitch Features in Transformer-Based Speech Recognition
Figure 3 for Voice Quality and Pitch Features in Transformer-Based Speech Recognition
Figure 4 for Voice Quality and Pitch Features in Transformer-Based Speech Recognition
Viaarxiv icon

English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System

Add code
May 09, 2021
Figure 1 for English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System
Figure 2 for English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System
Viaarxiv icon

Speech Enhancement for Wake-Up-Word detection in Voice Assistants

Add code
Jan 29, 2021
Figure 1 for Speech Enhancement for Wake-Up-Word detection in Voice Assistants
Figure 2 for Speech Enhancement for Wake-Up-Word detection in Voice Assistants
Figure 3 for Speech Enhancement for Wake-Up-Word detection in Voice Assistants
Figure 4 for Speech Enhancement for Wake-Up-Word detection in Voice Assistants
Viaarxiv icon

BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge

Add code
Jan 29, 2021
Figure 1 for BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge
Figure 2 for BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge
Figure 3 for BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge
Figure 4 for BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge
Viaarxiv icon

Convolutional Speech Recognition with Pitch and Voice Quality Features

Add code
Sep 02, 2020
Figure 1 for Convolutional Speech Recognition with Pitch and Voice Quality Features
Figure 2 for Convolutional Speech Recognition with Pitch and Voice Quality Features
Viaarxiv icon

Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing

Add code
Nov 12, 2019
Figure 1 for Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing
Figure 2 for Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing
Figure 3 for Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing
Viaarxiv icon