Picture for Arnaud Joly

Arnaud Joly

INRIA Saclay - Ile de France

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Add code
Feb 15, 2024
Figure 1 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Figure 2 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Figure 3 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Figure 4 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Viaarxiv icon

Controllable Emphasis with zero data for text-to-speech

Add code
Jul 13, 2023
Viaarxiv icon

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody

Add code
Jun 29, 2022
Figure 1 for Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
Figure 2 for Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
Figure 3 for Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
Figure 4 for Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
Viaarxiv icon

Distribution augmentation for low-resource expressive text-to-speech

Add code
Feb 19, 2022
Figure 1 for Distribution augmentation for low-resource expressive text-to-speech
Figure 2 for Distribution augmentation for low-resource expressive text-to-speech
Figure 3 for Distribution augmentation for low-resource expressive text-to-speech
Figure 4 for Distribution augmentation for low-resource expressive text-to-speech
Viaarxiv icon

Multi-Scale Spectrogram Modelling for Neural Text-to-Speech

Add code
Jun 29, 2021
Figure 1 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Figure 2 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Figure 3 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Figure 4 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Viaarxiv icon

A learned conditional prior for the VAE acoustic space of a TTS system

Add code
Jun 14, 2021
Figure 1 for A learned conditional prior for the VAE acoustic space of a TTS system
Figure 2 for A learned conditional prior for the VAE acoustic space of a TTS system
Figure 3 for A learned conditional prior for the VAE acoustic space of a TTS system
Figure 4 for A learned conditional prior for the VAE acoustic space of a TTS system
Viaarxiv icon

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech

Add code
Nov 04, 2020
Figure 1 for Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
Figure 2 for Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
Figure 3 for Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
Viaarxiv icon

Gradient tree boosting with random output projections for multi-label classification and multi-output regression

Add code
May 18, 2019
Figure 1 for Gradient tree boosting with random output projections for multi-label classification and multi-output regression
Figure 2 for Gradient tree boosting with random output projections for multi-label classification and multi-output regression
Figure 3 for Gradient tree boosting with random output projections for multi-label classification and multi-output regression
Figure 4 for Gradient tree boosting with random output projections for multi-label classification and multi-output regression
Viaarxiv icon

Exploiting random projections and sparsity with random forests and gradient boosting methods -- Application to multi-label and multi-output learning, random forest model compression and leveraging input sparsity

Add code
Apr 26, 2017
Figure 1 for Exploiting random projections and sparsity with random forests and gradient boosting methods -- Application to multi-label and multi-output learning, random forest model compression and leveraging input sparsity
Figure 2 for Exploiting random projections and sparsity with random forests and gradient boosting methods -- Application to multi-label and multi-output learning, random forest model compression and leveraging input sparsity
Figure 3 for Exploiting random projections and sparsity with random forests and gradient boosting methods -- Application to multi-label and multi-output learning, random forest model compression and leveraging input sparsity
Figure 4 for Exploiting random projections and sparsity with random forests and gradient boosting methods -- Application to multi-label and multi-output learning, random forest model compression and leveraging input sparsity
Viaarxiv icon

Simple connectome inference from partial correlation statistics in calcium imaging

Add code
Nov 18, 2014
Figure 1 for Simple connectome inference from partial correlation statistics in calcium imaging
Figure 2 for Simple connectome inference from partial correlation statistics in calcium imaging
Figure 3 for Simple connectome inference from partial correlation statistics in calcium imaging
Figure 4 for Simple connectome inference from partial correlation statistics in calcium imaging
Viaarxiv icon