Picture for Toshiyuki Kumakura

Toshiyuki Kumakura

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

Add code
May 16, 2022
Figure 1 for SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Figure 2 for SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Figure 3 for SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Figure 4 for SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Viaarxiv icon

Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end

Add code
Jan 24, 2022
Figure 1 for Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end
Figure 2 for Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end
Figure 3 for Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end
Figure 4 for Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end
Viaarxiv icon

Towards Online End-to-end Transformer Automatic Speech Recognition

Add code
Oct 25, 2019
Figure 1 for Towards Online End-to-end Transformer Automatic Speech Recognition
Figure 2 for Towards Online End-to-end Transformer Automatic Speech Recognition
Figure 3 for Towards Online End-to-end Transformer Automatic Speech Recognition
Figure 4 for Towards Online End-to-end Transformer Automatic Speech Recognition
Viaarxiv icon

Transformer ASR with Contextual Block Processing

Add code
Oct 16, 2019
Figure 1 for Transformer ASR with Contextual Block Processing
Figure 2 for Transformer ASR with Contextual Block Processing
Figure 3 for Transformer ASR with Contextual Block Processing
Figure 4 for Transformer ASR with Contextual Block Processing
Viaarxiv icon

End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System

Add code
May 17, 2019
Figure 1 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Figure 2 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Figure 3 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Figure 4 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Viaarxiv icon