Picture for Wataru Nakata

Wataru Nakata

The T05 System for The VoiceMOS Challenge 2024: Transfer Learning from Deep Image Classifier to Naturalness MOS Prediction of High-Quality Synthetic Speech

Add code
Sep 14, 2024
Viaarxiv icon

J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling

Add code
Jul 22, 2024
Figure 1 for J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling
Figure 2 for J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling
Figure 3 for J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling
Figure 4 for J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling
Viaarxiv icon

Building speech corpus with diverse voice characteristics for its prompt-based representation

Add code
Mar 20, 2024
Viaarxiv icon

UTDUSS: UTokyo-SaruLab System for Interspeech2024 Speech Processing Using Discrete Speech Unit Challenge

Add code
Mar 20, 2024
Viaarxiv icon

Coco-Nut: Corpus of Japanese Utterance and Voice Characteristics Description for Prompt-based Control

Add code
Sep 24, 2023
Viaarxiv icon

UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022

Add code
Apr 05, 2022
Figure 1 for UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
Figure 2 for UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
Figure 3 for UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
Figure 4 for UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
Viaarxiv icon

J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis

Add code
Jan 26, 2022
Figure 1 for J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Figure 2 for J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Figure 3 for J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Figure 4 for J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Viaarxiv icon