Picture for Min-Jae Hwang

Min-Jae Hwang

Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation

Add code
Jun 04, 2024
Viaarxiv icon

Seamless: Multilingual Expressive and Streaming Speech Translation

Add code
Dec 08, 2023
Figure 1 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 2 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 3 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 4 for Seamless: Multilingual Expressive and Streaming Speech Translation
Viaarxiv icon

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation

Add code
Aug 23, 2023
Figure 1 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 2 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 3 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 4 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Viaarxiv icon

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems

Add code
Jul 01, 2022
Figure 1 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Figure 2 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Figure 3 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Figure 4 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Viaarxiv icon

TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder

Add code
Jun 30, 2022
Figure 1 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Figure 2 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Figure 3 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Figure 4 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Viaarxiv icon

Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss

Add code
Jan 19, 2021
Figure 1 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Figure 2 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Figure 3 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Figure 4 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Viaarxiv icon

Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators

Add code
Oct 27, 2020
Figure 1 for Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Figure 2 for Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Figure 3 for Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Figure 4 for Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Viaarxiv icon