Picture for Ohsung Kwon

Ohsung Kwon

HyperCLOVA X Technical Report

Add code
Apr 13, 2024
Viaarxiv icon

Unified Speech-Text Pretraining for Spoken Dialog Modeling

Add code
Feb 08, 2024
Figure 1 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Figure 2 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Figure 3 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Figure 4 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Viaarxiv icon

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems

Add code
Jul 01, 2022
Figure 1 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Figure 2 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Figure 3 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Figure 4 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Viaarxiv icon

TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder

Add code
Jun 30, 2022
Figure 1 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Figure 2 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Figure 3 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Figure 4 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Viaarxiv icon

Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss

Add code
Jan 19, 2021
Figure 1 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Figure 2 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Figure 3 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Figure 4 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Viaarxiv icon

Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems

Add code
May 21, 2019
Figure 1 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Figure 2 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Figure 3 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Figure 4 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Viaarxiv icon