Picture for Thilo Koehler

Thilo Koehler

Ultra-lightweight Neural Differential DSP Vocoder For High Quality Speech Synthesis

Add code
Jan 19, 2024
Viaarxiv icon

Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling

Add code
Apr 01, 2021
Figure 1 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Figure 2 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Figure 3 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Figure 4 for Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling
Viaarxiv icon

FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge

Add code
Nov 25, 2020
Figure 1 for FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge
Figure 2 for FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge
Figure 3 for FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge
Figure 4 for FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge
Viaarxiv icon

G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR

Add code
Oct 22, 2019
Figure 1 for G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR
Figure 2 for G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR
Figure 3 for G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR
Figure 4 for G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR
Viaarxiv icon