Picture for Takenori Yoshimura

Takenori Yoshimura

Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System

Add code
Nov 21, 2022
Viaarxiv icon

ESPnet2-TTS: Extending the Edge of TTS Research

Add code
Oct 15, 2021
Figure 1 for ESPnet2-TTS: Extending the Edge of TTS Research
Figure 2 for ESPnet2-TTS: Extending the Edge of TTS Research
Figure 3 for ESPnet2-TTS: Extending the Edge of TTS Research
Figure 4 for ESPnet2-TTS: Extending the Edge of TTS Research
Viaarxiv icon

Neural Sequence-to-Sequence Speech Synthesis Using a Hidden Semi-Markov Model Based Structured Attention Mechanism

Add code
Aug 31, 2021
Figure 1 for Neural Sequence-to-Sequence Speech Synthesis Using a Hidden Semi-Markov Model Based Structured Attention Mechanism
Figure 2 for Neural Sequence-to-Sequence Speech Synthesis Using a Hidden Semi-Markov Model Based Structured Attention Mechanism
Figure 3 for Neural Sequence-to-Sequence Speech Synthesis Using a Hidden Semi-Markov Model Based Structured Attention Mechanism
Viaarxiv icon

End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection

Add code
Feb 14, 2020
Figure 1 for End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection
Figure 2 for End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection
Figure 3 for End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection
Figure 4 for End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection
Viaarxiv icon

ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit

Add code
Oct 24, 2019
Figure 1 for ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Figure 2 for ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Figure 3 for ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Figure 4 for ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Viaarxiv icon

A Comparative Study on Transformer vs RNN in Speech Applications

Add code
Sep 28, 2019
Figure 1 for A Comparative Study on Transformer vs RNN in Speech Applications
Figure 2 for A Comparative Study on Transformer vs RNN in Speech Applications
Figure 3 for A Comparative Study on Transformer vs RNN in Speech Applications
Figure 4 for A Comparative Study on Transformer vs RNN in Speech Applications
Viaarxiv icon