Picture for Shinnosuke Takamichi

Shinnosuke Takamichi

A Transformer Framework for Simultaneous Segmentation, Classification, and Caller Identification of Marmoset Vocalization

Add code
Nov 06, 2024
Viaarxiv icon

A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization

Add code
Oct 30, 2024
Viaarxiv icon

DNN-based ensemble singing voice synthesis with interactions between singers

Add code
Sep 16, 2024
Viaarxiv icon

Text-To-Speech Synthesis In The Wild

Add code
Sep 13, 2024
Viaarxiv icon

BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec

Add code
Sep 09, 2024
Figure 1 for BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec
Figure 2 for BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec
Figure 3 for BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec
Figure 4 for BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec
Viaarxiv icon

SaSLaW: Dialogue Speech Corpus with Audio-visual Egocentric Information Toward Environment-adaptive Dialogue Speech Synthesis

Add code
Aug 13, 2024
Viaarxiv icon

J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling

Add code
Jul 22, 2024
Figure 1 for J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling
Figure 2 for J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling
Figure 3 for J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling
Figure 4 for J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling
Viaarxiv icon

Textless Dependency Parsing by Labeled Sequence Prediction

Add code
Jul 14, 2024
Viaarxiv icon

Who Finds This Voice Attractive? A Large-Scale Experiment Using In-the-Wild Data

Add code
Jul 05, 2024
Viaarxiv icon

Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals

Add code
Jun 25, 2024
Viaarxiv icon