Picture for Tomohiko Nakamura

Tomohiko Nakamura

Discrete Speech Unit Extraction via Independent Component Analysis

Add code
Jan 11, 2025
Viaarxiv icon

DNN-based ensemble singing voice synthesis with interactions between singers

Add code
Sep 16, 2024
Viaarxiv icon

Physics-Informed Machine Learning For Sound Field Estimation

Add code
Aug 27, 2024
Figure 1 for Physics-Informed Machine Learning For Sound Field Estimation
Figure 2 for Physics-Informed Machine Learning For Sound Field Estimation
Figure 3 for Physics-Informed Machine Learning For Sound Field Estimation
Figure 4 for Physics-Informed Machine Learning For Sound Field Estimation
Viaarxiv icon

Self-Supervised Speech Representations are More Phonetic than Semantic

Add code
Jun 12, 2024
Figure 1 for Self-Supervised Speech Representations are More Phonetic than Semantic
Figure 2 for Self-Supervised Speech Representations are More Phonetic than Semantic
Figure 3 for Self-Supervised Speech Representations are More Phonetic than Semantic
Figure 4 for Self-Supervised Speech Representations are More Phonetic than Semantic
Viaarxiv icon

Neural Blind Source Separation and Diarization for Distant Speech Recognition

Add code
Jun 12, 2024
Viaarxiv icon

Real-time Speech Extraction Using Spatially Regularized Independent Low-rank Matrix Analysis and Rank-constrained Spatial Covariance Matrix Estimation

Add code
Mar 19, 2024
Viaarxiv icon

Sampling-Frequency-Independent Universal Sound Separation

Add code
Sep 22, 2023
Viaarxiv icon

Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides

Add code
Jun 19, 2023
Figure 1 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Figure 2 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Figure 3 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Figure 4 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Viaarxiv icon

How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics

Add code
Jun 01, 2023
Figure 1 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 2 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 3 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 4 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Viaarxiv icon

jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus

Add code
Dec 09, 2022
Viaarxiv icon