Picture for Edresson Casanova

Edresson Casanova

Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference

Add code
Sep 18, 2024
Viaarxiv icon

XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model

Add code
Jun 07, 2024
Viaarxiv icon

MLAAD: The Multi-Language Audio Anti-Spoofing Dataset

Add code
Jan 17, 2024
Viaarxiv icon

CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages

Add code
Jun 16, 2023
Viaarxiv icon

Evaluation of Speech Representations for MOS prediction

Add code
Jun 16, 2023
Viaarxiv icon

Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person

Add code
May 26, 2023
Figure 1 for Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person
Figure 2 for Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person
Viaarxiv icon

Interpretability Analysis of Deep Models for COVID-19 Detection

Add code
Nov 25, 2022
Viaarxiv icon

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus

Add code
Jul 07, 2022
Figure 1 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 2 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 3 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 4 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Viaarxiv icon

A single speaker is almost all you need for automatic speech recognition

Add code
Mar 29, 2022
Figure 1 for A single speaker is almost all you need for automatic speech recognition
Figure 2 for A single speaker is almost all you need for automatic speech recognition
Figure 3 for A single speaker is almost all you need for automatic speech recognition
Viaarxiv icon

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Add code
Dec 04, 2021
Figure 1 for YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Figure 2 for YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Figure 3 for YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Figure 4 for YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Viaarxiv icon