Picture for Naoya Takahashi

Naoya Takahashi

SilentCipher: Deep Audio Watermarking

Add code
Jun 06, 2024
Figure 1 for SilentCipher: Deep Audio Watermarking
Figure 2 for SilentCipher: Deep Audio Watermarking
Figure 3 for SilentCipher: Deep Audio Watermarking
Figure 4 for SilentCipher: Deep Audio Watermarking
Viaarxiv icon

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events

Add code
Jun 15, 2023
Figure 1 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 2 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 3 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 4 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Viaarxiv icon

Iteratively Improving Speech Recognition and Voice Conversion

Add code
May 24, 2023
Viaarxiv icon

The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation

Add code
May 13, 2023
Viaarxiv icon

Cross-modal Face- and Voice-style Transfer

Add code
Mar 01, 2023
Viaarxiv icon

Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing

Add code
Feb 21, 2023
Viaarxiv icon

CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos

Add code
Dec 14, 2022
Viaarxiv icon

Hierarchical Diffusion Models for Singing Voice Neural Vocoder

Add code
Oct 18, 2022
Figure 1 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Figure 2 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Figure 3 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Figure 4 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Viaarxiv icon

DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability

Add code
Oct 11, 2022
Figure 1 for DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
Figure 2 for DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
Figure 3 for DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
Figure 4 for DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
Viaarxiv icon

Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer

Add code
Aug 26, 2022
Figure 1 for Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer
Figure 2 for Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer
Figure 3 for Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer
Figure 4 for Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer
Viaarxiv icon