Picture for Nobuaki Minematsu

Nobuaki Minematsu

Analytic Study of Text-Free Speech Synthesis for Raw Audio using a Self-Supervised Learning Model

Add code
Dec 04, 2024
Viaarxiv icon

A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings

Add code
Oct 03, 2024
Figure 1 for A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings
Figure 2 for A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings
Figure 3 for A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings
Figure 4 for A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings
Viaarxiv icon

Simulating Native Speaker Shadowing for Nonnative Speech Assessment with Latent Speech Representations

Add code
Sep 19, 2024
Figure 1 for Simulating Native Speaker Shadowing for Nonnative Speech Assessment with Latent Speech Representations
Figure 2 for Simulating Native Speaker Shadowing for Nonnative Speech Assessment with Latent Speech Representations
Figure 3 for Simulating Native Speaker Shadowing for Nonnative Speech Assessment with Latent Speech Representations
Figure 4 for Simulating Native Speaker Shadowing for Nonnative Speech Assessment with Latent Speech Representations
Viaarxiv icon

A Pilot Study of GSLM-based Simulation of Foreign Accentuation Only Using Native Speech Corpora

Add code
Jul 16, 2024
Figure 1 for A Pilot Study of GSLM-based Simulation of Foreign Accentuation Only Using Native Speech Corpora
Figure 2 for A Pilot Study of GSLM-based Simulation of Foreign Accentuation Only Using Native Speech Corpora
Figure 3 for A Pilot Study of GSLM-based Simulation of Foreign Accentuation Only Using Native Speech Corpora
Figure 4 for A Pilot Study of GSLM-based Simulation of Foreign Accentuation Only Using Native Speech Corpora
Viaarxiv icon

Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music

Add code
Jun 15, 2023
Viaarxiv icon

Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition

Add code
Apr 08, 2022
Figure 1 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 2 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 3 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 4 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Viaarxiv icon

Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder

Add code
Jul 31, 2018
Figure 1 for Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
Figure 2 for Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
Figure 3 for Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
Figure 4 for Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
Viaarxiv icon